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SPECIAL ISSUE IN HONOUR OF C. J. BOUWKAMP 
ON THE OCCASION OF HIS SIXTIETH BIRTHDAY 
AND HIS RETIREMENT AS SCIENTIFIC ADVISER OF 
PHILIPS RESEARCH LABORATORIES AND EDITOR-IN-CHIEF OF 
PHILIPS RESEARCH REPORTS 


Christoffel Jacob Bouwkamp was born at Hoogkerk, The Netherlands, on 
June 26, 1915. He studied at the University of Groningen, The Netherlands, 
where he graduated in theoretical physics, mechanics, and mathematics in 1938. 
After his military service he completed his doctoral thesis (under Е. Zernike) 
on diffraction theory, and gave courses on mechanics and quantum mechanics 
at the University of Groningen. 

In 1941 he joined the staff of Philips Research Laboratories, N.V. Philips’ 
Gloeilampenfabrieken, Eindhoven, The Netherlands. From 1946 he also was 
an Editor, and since 1952 the Editor-in-Chief of Philips Research Reports. In 
1969 he was appointed Scientific Adviser for Applied Mathematics at Philips 
Research Laboratories. 

He has been Professor Extraordinary of Applied Mathematics since 1955, 
first at the University of Utrecht, The Netherlands, until 1958, and since then 
at the Technological University Eindhoven. On several occasions he was 
Visiting Lecturer, Research Associate, and Research Mathematician at various 
institutes in Europe and the United States. He is a Member of the “Koninklijke 
Nederlandse Akademie van Wetenschappen" (Royal Dutch Academy of 
Sciences). 


Any centre of human activity — such as, for example, a research laboratory 
— is necessarily dependent on the form of its organization, on its budget, on the 
atmosphere, on its technical facilities, and — in short — on many matters over 
which the individual has little influence. 

When, therefore, an individual is lauded, praised or eulogized he is likely to 
reply in terms of: “I was only one of a team”, or “I stood on the shoulders of 
others”, or “I grasped the opportunity afforded me by others”. 

Yet we all know that such remarks express only a part of the truth, for the 
individual is indeed unique and he moulds the course of events. No one is 
irreplaceable but no individual can take the place of another, ten good math- 
ematicians do not make one great one, and the new vintage, however good, will 
not have the taste of the old. 

With effect from July 1, 1975 Bouwkamp the mathematician will no longer 
be in the ranks of the Philips Research Laboratories. 

With effect from that date Philips Research Reports has to be edited and 
guided by another hand. 

With effect from that date Bouwkamp the man will be absent from our 
corridors. 

An individual leaves us; we lose a voice, a talent, an adviser, a hobbyist and 
a master of his craft. 

Mathematics in general, this laboratory and this journal are greatly in his 
debt. We all wish to express our thanks but in no standardized manner and in 
no blaze of publicity. We were of the opinion that a special issue of Philips 
Research Reports to which he devoted so much of his enthousiasm, would be 
a good synthesis between the honour to an individual and the manifestation of 
what may be achieved by joint activity. 

We hope that the scientific world will appreciate this collective publication 
inspired by and dedicated to a colleague and friend. 

Many have contributed to this issue; Bouwkamp was not a contributor but 
without him it could not have been written. 


The Directors, 
Philips Research Laboratories 
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LASER MODES; AN IMPERFECT TRIBUTE 
TO C. J. BOUWKAMP 


by Н. B. G. CASIMIR | 


I feel rather ashamed that in an issue of the Philips Research Reports 
dedicated to C. J. PME Ihave notbing bener to offer than: some Sketehy 
considerations. 

The truth of the matter is that foi several years NS I have from time to 
time been thinking on a subject that, properly worked out, might have made a 
suitable contribution. I have even occasionally discussed the matter with 
Bouwkamp and he has taken me to task for not yet having filled in the analytical 
details. But somehow I have been too busy (or too làzy?) to do.this with a 
precision and completeness that would meet Bouwkamp's stringent standards. 
Let me all the same state the problem. iP noh et 

In 1928 Gamow published his theory of «-decay 5, The essential idea is that 

a-particles are escaping from a nucleus by tunnelling through a potential barrier. 
Gamow treated the problem by introducing the notion of complex eigenvalues. 
They correspond to wave functions that are outgoing waves for large r, are 
exponential in the region where E— V < 0, and are toa high’ degree of approx- 
imation standing waves inside the barrier. 

To my knowledge such complex eigenvalues have not been used very exten- 
sively, and I am not aware — but this may be. my: lack of familiarity with the 
relevant literature — of the existence of a body of general theory describing the 
relation of such eigenstates to.the complete set of orthogonal eigenfunctions. 
But it struck me that the concept, of complex: eigenvalues might conveniently 
be used to calculate electromagnetic modes in certain arrangements in gas lasers. 

Let us consider the following simple arrangement: The space between two 
perfectly conducting mirrors at z = 0 and z=Lis subdivided in three regions: 


(D x<-d;-- Ш афа (Ш) x >d. 


In (Т) and (III) the dilecte: constant is € > 1; in (II) it is unity. 
' We look for ап electromagnetic wave of the following form: 
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in (I) and (III) 


E, = B sin (koz) exp (—ix|x|) exp (ior), (1) 
in (П) 
E, = A sin (Koz) cos (kx) exp (iot). Q) 


These expressions are solutions of Maxwell's equations, if 
ky? +x? = е w/c’, (3) 
ko? + ke? = с. @ 


The boundary condition at x = d is 


dE, dE, : 
— JE; =|{—/£,]}.- (5) 
ox ш ox и 
The boundary condition at x = —d is then automatically fulfilled. 
Applying (5) to the expressions (1) and (2) gives 


x 


ik 
cot (k,d) = ——. 
x 


Together with (3) and (4) this is sufficient to determine ky, x and w for given ko. 
We shall now make a simple first-order approximation. Suppose the wave, 
with А < d, is essentially running parallel to the z-axis. Then k, « х. We write 


хо == (є = 1)? ko and Wo = с Ко. 
For k, we can now write 
К. = k£ + 6k,, k,°=(s+4)2/d мр5 = 0, 1, 2,.... 


It follows 


ik, 
ôk, d = ; 
хо 
i (ky 
й (5+ 9) л xo 


and this gives an imaginary contribution to о: 
і (uy 1 
до — — Wg. 


сарая k E-P" 


Since k,° < ko the damping is very small indeed and the condition E, = 0 for 


LASER MODES; AN IMPERFECT TRIBUTE ТО С. J. BOUWKAMP 3* 


x = + dis a very good approximation. Physically this means that the waves 
inside the slit between the dielectrics (that is in region (II)) hit the dielectrics at 
grazing incidence, so that reflectivity is high. Only little energy escapes, of 
course in a direction very close to the angle of total reflection. 

Now it is quite possible to extend this type of calculation to a cylindrical 
arrangement, where e = 1 forr < R and e >1forr > R. Such an arrangement 
is close to the actual construction of early helium-neon lasers built in the Philips 
Laboratories. There are as far as I can see no particular difficulties, neither in 
formulating the general equations nor in finding simple approximations similar 
to those I described above. But the calculations remain to be done. 

And here we meet a striking difference between Bouwkamp and myself: if 
Bouwkamp tackles a problem he deals with it exhaustively, he “exterminates” 
the problem, to use his own words. (Of course he will have to admit that some 
problems are like a cat with nine lives.) 

Bouwkamp and I wrote one paper together; it dealt with multipole radiation. 
Again, our approach was different. I would distrust my own calculations (and 
rightly so) unless I knew beforehand from general considerations what the 
answer would have to look like. Bouwkamp, on the other hand, was very 
sceptical of such general, mainly group-theoretical, arguments until he had 
verified them by explicit calculations. I think the result was a rather useful 
paper. 

Bouwkamp’s published work deals mainly with Maxwell’s theory. He is not 
primarily concerned with the electromagnetic properties of matter: often they 
are described simply by constants e, и and о. But within that restriction his 
work covers a great variety of problems: antennae, diffraction, waveguides, etc. 
It shows the power of explicit mathematics, applied with consummate skill, 
thoroughness and accuracy. His work on diffraction through holes, with very 
complete evaluations at all distances, and his fundamental work on supergain 
antennae (with N. G. de Bruijn) should be specifically mentioned. 

There is another, more playful, aspect of Bouwkamp's work. He has been 
studying magic squares and a variety of mathematical games and puzzles. For 
such work he has made extensive use of computers, and the novel types of 
programming that were required interested him probably as much — if not 
more — than the puzzles themselves. 

Finally there is an aspect of Bouwkamp's activities that is not at once evident 
from published literature. It is his role as an adviser. During his years with 
Philips innumerable people — myself included — walked into his office with 
some mathematical problem and went away with very sound advice how to 
tackle the question, or — if it was a really tough nut to crack — with a promise 
that Bouwkamp would look into the matter. In this way he made a most valuable 
contribution to the work of the Philips Laboratories; as a one time director I 
should like to express my admiration and my gratitude. 
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And if I have not done my homework, if I have not finished the problem with 
which I began this note, it is certainly not for lack of advice, nor even for lack 
of prodding, by Bouwkamp. P x 


; REFERENCE 
1). G. Gamow, Z. f. Phys. 51, 204-212, 1928. 
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A LESSON IN ELEMENTARY GEOMETRY 
by O. BOTTEMA 


Professor Emeritus, Delft University of Technology 
Delft, The Netherlands 


(Received May 9, 1974) 


1. Introduction 


‘When a man celebrates his sixtieth anniversary the number of those who were 
adult when he was a boy must be rather restricted. I saw Bouwkamp for the first 
time in 1931; we attended then the same secondary school, be it in different not 
to say opposite capacities. Obviously our first meetings impressed me very much, 
for whenever I came across him at irregular intervals during the next forty-five 
years, at the first flashing moment of the encounter he was sitting in my imagina- 
tion on the first bench of the classroom, in the left-hand row as seen from the 
teacher’s position. And though growing older and rising continually in scientific 
eminence he did not change essentially, thus illustrating the poet’s aphorism that 
the child is father to the man. He was at the time a sturdy youngster, cheerful and 
good-natured, active and unsophisticated, with a passion for exact knowledge 
and the necessary healthy ambition to solve mathematical problems, but also 
with a patient and understanding readiness to assist less-gifted school fellows. 
As one of his teachers at that time I am still unreasonably proud of having 
helped him to discover some notions of simple algebra and trigonometry, which 
may have been the origin of Bouwkamp’s profound knowledge of mathematical 
analysis, the tool he needed to formulate the laws of nature and to reveal its 
secrets. Therefore, let history repeat itself for once: I hope that my learned friend 
is willing to accept as a birthday present from his former schoolmaster a problem 
of elementary geometry, to remind him of the old days and as a token of my 
respect and my gratitude. 


2. The problem 


Our elementary problem reads as follows: the angles of a plane quadrangle 
are given; determine the sides. Let 4 = (4,424344) be the quadrangle, 
LA: = а 444, = х, 4.42 = х, A243 = хз, A344, = x4 (fig. 1). The 
angles о, with Ў о; = 27, are known, which means that A must satisfy three 
independent conditions. As a quadrangle is determined by five data we may 
expect a set of оо? solutions. But if (хс) is a solution and k an arbitrary non- 
zero constant obviously (k x;) is a solution as well. 


О.ВОТТЕМА 


For fixed x, and variable k the quadrangles (k x;) are all similar. We shall 
consider such a class as one solution of the problem. Hence there will be со! 
essentially different quadrangles with. the given angles. 

Let us suppose first that no angle is more than л. Аз У a, = 2л there are 
always two adjacent angles the sum of which is equal to or more than л. 
Without any loss of generality we may suppose that these are «, and ад. Let 
A, and A, (fig. 1) and therefore x, be fixed. Hence 4,4; and 444; are either 
parallel or they have their intersection S to the left of 4,44. If о 27 — 01 
the vertex A, appears on SA4, for any xz, to the right of 44; for à; < л — 0, 
the same holds if x; is large enough. If one angle, say аз, is more than л, we 
take x4 and x, to be arbitrary (fig. 2) and complete the construction which 
shows that A; is inside the triangle 414244. 

The important conclusion is: there are always quadrangles A with prescribed 
angles оң; moreover, of those constructed so far the perimeters have no double 
points. 

We shall now determine the sides x;. In fig. 1 we have 


Е LS = л 0 — аз = —л | 0, + 04 
and іп fig. 2 
Z ASA; = m — à, — 04 = л + 42 + 05. 


The oriented track A,A,A3A, is projected on an axis perpendicular to 4,4; 
and we obtain (in both cases) the relation 


U, = x, sin a, + x4 sin (0 + аз) — xs sin a, = 0. (1) 
Similarly, projecting 444,4243 on an axis perpendicular to 4344, we have 
U, = x, sin о + x2 sin (a, + a3) — x3 sin o = 0. (2) 


(1) and (2) are two independent equations for the unknowns xj; they are 
homogeneous linear equations and as we are only interested in the ratios of x, 
our problem is essentially solved and we see moreover that these ratios will be 
linear functions of one parameter. 
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3. A mapping 


In order to visualize the subject we make use of the following representation. 
A class of mutually similar quadrangles is mapped onto the point, in three- 
dimensional projective space, with the homogeneous coordinates (xj). U, and 
0 represent two planes in the image space; hence the solutions of our problem 
are mapped onto points of their line of intersection 1. 

The question arises whether any point on / represents a solution. This will 
certainly not be the case if we restrict ourselves to those quadrangles (from now 
on to be called elementary quadrangles) the sides of which all have the same 
sign. We shall generalize the concept of a quadrangle in a natural way, accepting 
a specimen whose perimeters have a double point and introducing negative 
sides. 

Let А and A’ be two non-similar quadrangles with the same angles. By 
changing the scale of A’ in a suitable way we can displace it so that A,’ coincides 
with A, and A,’ with A,; then fig. 3 appears, with 4,’A;’ parallel to 4,43. 
If Аз' moves along 4344 and A,’ along 4,4; in such a way that 4,'43' remains 
parallel to 4743 the new quadrangle is still a solution. If A,’ arrives at A, the 
quadrangle is degenerated, with x, = 0 and /4,4,4; —z-—a,— 9 
(fig. 4). Further situations are given in fig. 5 (with x, < 0) and in fig. 6 (with 
X4 « 0, x; « 0). Taking the signs into account the relations (1) and (2) are 
still valid. It follows from all this that with x, fixed, x4 can have any real value. 
The conclusion is: any point of the image line / represents a solution of the 
problem. The set of solutions is mapped onto the line 1. 

The set contains four degenerated quadrangles, which are in general distinct; 
they correspond to the intersections of / and the coordinate planes. That for 
which x4 — 0 follows from (1) and (2) but more easily from fig. 4 and is seen 
to be 


X, = sin 02, x, = sin (a, + 9), хз = sin оү, x, = 0; (3) 
the three others follow by cyclic permutation. As / is determined by two of its 


points the set of solutions of our problem may be given (be it in a non-sym- 
metrical way) by 


хі = sina, + А ѕіп (о; + o4), x2 = sin (a, + a) + Asin ош, 
хз = SİN 01, ха = Asin 01, (4) 
A being a parameter. Any set of four angles о; with 2: о; = 2л corresponds 
to a line /. Conversely a line / corresponds in general to one set of angles. 


There is however a singular line /, corresponding to all sets with 


ay = 03 = P, 905 = 0 = л — Ф; 


Fig. 5. Fig. 6. 


1, is given by 
ху — хз = X2 — Ха = 0; 


its points are the maps of parallelograms. 

The four given angles are equivalent to three independent data. Hence there 
are oo? image lines /. The space, however, contains со® lines. The conclusion 
is: the set of image lines 1 belongs to a line complex, which we denote by C. 
The Plücker coordinates ру of / are the minors of a matrix built from the 
coordinates of two of its points, e.g. 


sin 02 sin (à, + 2) sin 0; 0 
(5) 
sin (a, + a4) sin &4 0 sin &, 
Making use of the relation 
sin (о, + a4) sin (a, + &) = sin æ sin &4 — sin «, sin as, (6) 
which may be proved as a consequence of 2: о; = 2л, we obtain 
Pia = Sino, Pog = Sin (0; + 92), рза = sin o, (7) 
Раз = SİN од, Рз: = sin(o, + 04), — pia = Sin 03, 


which is a representation of the complex С by means of the (essentially three) 
parameters оц. 
Any line / belongs-to C; we shall show, however, in sec. 5 that the converse 
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is not true: not any line of C is a line /. Through a fixed point X(x,) of the 
image space pass oo! lines of C, the generators of a cone with vertex X. Sup- 
pose that x, are all positive and. any of them less than the sum of the other 
three. Then X is the map of a movable four-bar and any generator represents 
a set of its possible angles. The motion of this mechanism however (it is the 
dual problem of that we are dealing with) is a complicated affair and after 
having been studied for more than a century it has not yet revealed all its 
secrets. Therefore we do not continue the study of C or of the set of lines / 
on it here. 

For'an elementary quadrangle any side is less than the sum of the other 
three. For instance one has always —x, + х, + хз + x4 > 0. Obviously this 
inequality makes no sense for a general quadrangle, for we are allowed to 
multiply all x, by the same negative number. An elementary quadrangle with 
sides x, exists if and only if 


H = (х, + x. + хз + x4) 2 0. (8) 


For a generalized quadrangle this inequality is not senseless, but obviously it 
is not true. Indeed, we have Н = 0 for four values of 1, corresponding to the 
intersections of / with the planes V, = 0, V, = —x, + x; + хз + Xa etc., 
which in general are different. Hence in our set there are quadrangles with 

- H >0, with H = 0 and with Н < 0. There is an inequality, however, involving 
the sides and the sum of two opposite angles, which is valid for any quadrangle, 
elementary or generalized. If F stands for the area of an elementary quadrangle 
the following formula holds: 


16 F? = Н — 16 x, х, хз x4 cos? 1 (a, + оз). (9) 


The proof, to quote from Johnson's classical treatise +) involves long and 
rather unpleasant trigonometric reductions, but for us it has the advantage of 
being valid for generalized quadrangles as well. Hence the inequality 


H — 16 x, х» хз x4 cos? L(a, + аз) > 0. (10) 


This may also be checked by substituting (4) into the left-hand side, which gives 
us, again after much algebra, 


4sin? о, [A? sin a sin (à, + &4) + 2 Asin w sin ош + sin а, sin (о; + @„)]?. (11) 


If all x; have the same sign (or more general: if x, x. хз x4 > 0) eq. (10) 
implies Н > 0. We shall make use of eq. (10) in sec. 5. 
4. Special quadrangles 


As there are oo! quadrangles with given angles we may ask for one more 
condition to be satisfied. - 
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We give some examples. The quadrangle of the set for which V, = 0, was 
already mentioned. The condition is linear. 

When we substitute (4) into x, + x2 + x4 — x4 = 0, A is found and the 
quadrangle is seen to be 


x, = cos $a, sin ja, cos 2(0; + @), х2 = sin Таз sin За, sin (0, + &2), 


12 
хз = cos 3a, sinda, cos 3(a, + 94), ха = costa, cos $a; sin (a, + a). x 
If two opposite sides, x, and хз say, should be equal the condition is again linear; 
we must intersect / with the plane x, — x3 = 0. By means of (4) and making 
use of (6) we obtain 


x, = хз = sin(a, +4), X2 = —5іп аз + Sin од, x4 = sin «, —sin a. (13) 


We know that in our set there are always elementary quadrangles, for which 
(if they are convex) the condition to be a circumscribed quadrangle (that is a 
quadrangle the sides of which are tangents of a circle) reads 


ху + Хз = X2 + Xa 
once more a linear condition. The solution reads 


x, = sin фо, sin $æ; sin 3(a; + аз), x2 = sin os sin $a sin (0з + o4), 


хз = sin Фо sin $a; sin 3(04 + à,), x4 = sin 3a, sin 3o sin 3(o, + 02). 


(14) 


As all x, are positive the quadrangle is indeed elementary. 
One may ask whether our set of quadrangles with given angles contains any 
with zero area. As 


2F = X2 X3 sin [Л + Ха X1 sin has 
the condition F = 0 is quadratic and leads by means of (4) to the equation 
А2 sin 4 sin (a, + ош) + Asin o; sin &4 + sina, sin (0, 1-42) = 0, (15) 


which also follows from (11). Making use of (6) the discriminant of (15) is seen 
to be 
D = sin & sin 0, sin a3 sin &4. (16) 


Hence the theorem: there are two different quadrangles with given angles о; and 
zero area if and only if any angle is less than x. If this condition holds we find 
the sides by solving (15) and substituting А into (4). 

The answers will not be given here but we mention the elegant relations 


X, 2X3 = + (sin a; sin a5)? : (sin a, sin o,)!2, a7) 


X2 1X4 = F (sin аз sin ош)!/? : (sin a; sin @„)!?. 
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5. Cyclic quadrangles 


Until now we have supposed the four angles to satisfy Ў о; = 2x but to 
be furthermore arbitrary. We consider now the case that they are specified by 
the relation а, + аз = л, which implies « + а, = л. This means that we 
deal with cyclic or inscribed quadrangles: the four vertices are on a circle, 
the sides are chords of this circle (Dutch: “koordenvierhoek”). We shall see 
that this restriction gives a considerable simplification of the problem. 

From (4) it follows that the sides of cyclic quadrangles with given angles a, 
and а, are 

хі = sin a, — Å sin (0, — а), x; =sin(a, + аз) + Asin Oo, is 

Хз = Sin 0, Ха = Asin a. qe 

Any such set is mapped onto a special line / of the image space, which we 

denote by /’. The lines /’ depend on two parameters х; and a, and their locus 

is therefore a variety of oo? lines. The Plücker coordinates of a line I’ follow 
from (7) if we substitute «з = л — «,, «4 = л — a; we obtain 


Pia = singz, pz4= sin (0; + oj), Рза = Sin @,, (19) 
Роз = —Sin&,, ps, =—Sin (о; + a), рү; = sin Qi. ` 
These coordinates satisfy the equations 
ріг — рза = 0, pos + Pig = 0, (20) 


and the conclusion is: the lines 1' belong to a linear congruence, which we denote 
by L and which is of course a variety on the complex C. A linear congruence 
is the system of lines intersecting two fixed skew lines. It is easy to verify that 
for L these lines, m,, т», are: 


My: XQ— Хз = Хз -+%,=0; тә: xy + x3 = X, — x4 = 0. (21) 


They are real, L is a hyperbolic congruence. From (18) it follows that /’ inter- 
sects m, at M, such that we have 


Mı: xı = x3= cosha,—a2), х= —X4 = cos (0; + а), Q2) 
Mi: x; == —x3 = —sind(m — 2), х = х = sin 39, + a2). 


Hence the sides of the cyclic quadrangles with given angles «ү and &, are 


xı = A, cos (0; — а) — A, sin 39, — a2), 
хь = A, cos (o, + а) + A, sin о; + әз), (23) 
Хз = 4, cos (a, — o5) + 22 sin (0 — o), 


ха = —À, cos Y(a, + аз) + 22 sin (0, + о). 


For the sake of symmetry we have introduced two parameters 4, and A,. 
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If they vary independently (23) gives us co? cyclic quadrangles; not only all 
shapes but also any size is represented. 

From (21) we conclude that m, is the intersection of the planes V; and V; 
introduced in sec. 3, and m; is that of V, and V4. The four planes V, are the 
faces of a tetrahedron B,B,B3B,, with B, = (—1, 1, 1, 1), etc.; m, is the line 
B,B, and m; is B,B3. As I’ intersects m, and т», its four intersections with V; 
(different in the general case) are now coinciding in pairs. As for a cyclic 
quadrangle «, + аз = л the fundamental inequality (10) reads H > 0 and 
this may be satisfied for any point of /' by the circumstance that this line 
intersects V, and V; and again V; and V, at the same point. The configuration 
answers the question whether any line of L is a line /' and, as we shall see, in 
the negative. Indeed, if /' intersects m, at M, = (a, b,, a1, —b,) and mz at 
М, = (a, b2, —45, b2) for the point M, + А M; we obtain 


H = 2* 2° (a,? — b,?) (aq? + 5,7). (24) 


Hence from Н > 0 we conclude that a,? > b,? implies b3? > a,” and that 
a,? < b,? implies b,? < a;?. The situation in the image space is therefore 
as follows: Вз and B, divide m, into two disjoint intervals 7, and 7,5, say, 
and B, and B, divide m, into Г; and 7;;. Only those lines of L which join a 
point of 7,, to a point of one of the intervals Г; are lines I’ and so are those 
joining a point of 7,; to a point on the other interval on mz. Hence the linear 
congruence L consists of two subsets, L, and L3; the lines of only one of them, 
L, say, correspond to real cyclic quadrangles with prescribed angles. 'This implies, 
as we already announced in sec. 3, that not all lines of the complex C are 
lines /. 

Through any point Х(х;) of the image space, not on m, or mz, passes one 
line of L; if H(X) >0 it belongs to L,, if H(X) < 0 it is a line of L,. Hence 
the theorem: if x, are given lengths (any of them may be positive or negative) 
there exists a (unique) cyclic quadrangle with sides x, if and only if A(X) z 0. 
This is a generalization of a well-known theorem in elementary geometry: a 
cyclic quadrangle with given sides exists if and only if any side is less than the 
sum of the other three. 


6. A final remark 


We have determined the sides x, of quadrangles with given angles by means 
of the linear equations (1) and (2). An alternative method, not recommendable 
to obtain the solution, but with some interesting features, would be to make 
use of the two relations 
F, = x12 + x4*.— 2x, X2 cos a, — (X3? + x4? — 2 X3 x4 COS o3) = 0, Q3) 
Е, = X2? + х32.—2 х› x4 COS аз — (x4? + x1? — 2 x4 x1 cos o4) = 0, 


which follow immediately from the cosine law. The solutions are now mapped 
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on the intersection Г of the two quadrics F,. A further investigation shows, 
as could be expected, that Г is degenerated; one component is a straight line 
(our line / as a matter of fact) and the other one is a non-degenerated twisted 
cubic сз in the general case; for cyclic quadrangles сз is degenerated into three 
lines. The twisted cubic has two points in common with /, real or imaginary 
(they correspond to quadrangles of zero area by the way). The other points 
of c do not correspond to solutions; they have slipped in because of the 
imperfection of the method. 
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It is with the greatest pleasure that I present this article in honor of Dr C. J. 
Bouwkamp on the occasion of his sixtieth birthday. Clearly, his scientific con- 
tributions have been eminent. They have served us for many years and un- 
doubtedly will continue to do so for many more. Along with his other admirers, 
I wish Dr Bouwkamp a very happy birthday and many happy returns. 


1. Introduction 


One of the interesting problems of classical electromagnetic theory is the 
problem of finding the equation of motion for a classical charged particle in 
an electromagnetic field. What makes the problem challenging is the fact that 
the equation of motion is one of the axioms of the theory and hence cannot 
be derived. The best that one can do to arrive at an equation of motion is to 
construct a likely equation by guesswork or physical intuition and then test its 
validity by experiment. 

In cases where radiation reaction is negligibly small, no question arises because 
in such cases the Lorentz equation of motion +) is sufficient for calculating 
the motion of a charged particle. On the other hand, when radiation reaction 
is not negligible, the Lorentz-Dirac equation of motion ?) is used, which takes 
into account radiation reaction but does so in a way that introduces conceptual 
difficulties. Yet, despite these difficulties, the Lorentz-Dirac equation is accepted 
as the standard equation of motion on the ground that no improvement is 
possible or necessary. 

Recently, however, the problem was examined again and a new equation of 
motion $) was proposed. It is this new equation of motion that we shall briefly 
discuss here. 


2. The Lorentz-Dirac equation 
We recall that the Lorentz-Dirac equation of motion is relativistic and in- 
cludes radiation reaction. In vector form (MKS units) it can be written as 
d M 


mad-pyes о = e (E + vx B), (1) 
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е? 


pe = 
6 л £o c? 


Y dg dB\27 d 1 d v 
(6 3] (в <a) dace alt 


where @ = v/c. On the left-hand side of (1) the first term is Newton’s inertial 
force and the second term F'*? is the radiation reaction force. The term on the 
right-hand side is the Lorentz force. We note that when the term Е" is dropped 
the equation reduces to the Lorentz equation of motion. By splitting the reaction- 
force term we can write the equation in the following form: 


d Y е? Y dp \? dv]. 
"RU Py? блг ар) (5) (8а d 3] |- 


g e? d 1 d voo j 
TPE D e Ac Ву? dt i-um) @) 
As before, the first term on the left is Newton’s inertial force and the first term 
on the right is the Lorentz force. We recognize the second term on the left as 
the Larmor (or actually the Liénard) radiation-force term, and the second term 
on the right as the Schott term. 

The Lorentz-Dirac equation has certain inherent difficulties. First, the equa- 
tion involves the derivative of the acceleration (Schott term) and hence needs 
one extra condition in addition to the usual Newtonian initial conditions to 
determine the motion of the particle. Second, it yields runaway solutions which 
can be avoided only by introducing pre-acceleration. Third, in some situations 
it implies that the external energy supplied to the particle goes into kinetic 
energy only and accordingly radiation energy is created from an acceleration 
energy which is negative and unphysical. 

These difficulties cast some doubt on the validity of the Lorentz-Dirac equa- 
tion and encourage us to consider the new equation of motion given below. . 


3. The new equation 


We shall not go into the reasoning that led to the new equation of motion. 
Instead, we invite the interested reader to study the original papers on the 
subject ?). In vector form the new equation is 


d Y e? Y dg dg? 
с ы ср - (5) ]- 
dt (1— (1 82)172 бл £ c? (1 — В?) dt 


2, = e? ps 1 d v " 
EROSION sl аа тат) | 
(3) 
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As in the Dirac equation (2), we have on the left-hand side Newton's inertial 
force and Larmor's radiation force which now by itself is the radiation reaction. 
On the right-hand side we have the Lorentz force and a new force. The Lorentz 
force plus the new force constitutes the force that drives the particle. 

In the papers ?) referred to above the implications of the Dirac equation are 
compared and contrasted with those of the new equation. For certain basic 
physical situations (test cases) let us now review what these implications are: 


(a) Zero external field (E — 0, B — 0). When the external field is zero the 
new equation directly yields the reasonable solution v — constant whereas 
the Dirac equation requires that this solution be “physically” singled out from 
an infinity of runaway solutions. 


(b) Constant uniform electric field (E — constant and uniform, B — 0, par- 
ticle assumed to have initial velocity parallel to the electric field). Under these 
conditions the new equation yields 


eEs 
vy — ctanh (s. + —) 5 >0, 
тс 


where the initial velocity is some constant c tanh K,. We note that s is here 
the proper time and is related to the laboratory time / by ds = (1 — 52)! dt. 
The Dirac equation yields 


eEs 
vp — ctanh (s. + К, exp (s/t) + n, 
тс 


where т = е2/блєот. The term K, exp (s/t) is a runaway term and must be 
eliminated. This can be done by imposing an extra physical requirement, e.g. 
vp = constant when E = 0. From the above solution for vy we see that in 
the new equation the inertial force equals the Lorentz force and the radiation 
force equals the new force. Consequently, one can think of the Lorentz force 
as the force that is supplying kinetic energy, and the new force as the force that 
is supplying the radiation. According to the Dirac equation, however, the radia- 
tion is supplied by the self-acceleration-energy Schott term, and the radiation 
reaction, Е" of eq. (1), is zero. The Dirac equation gives rise to the paradox 
that the particle radiates but experiences no radiation reaction! The new equa- 
tion is not burdened with any such. puzzle. 


(c) Incident rectangular pulse (E = constant for 0< s< sı, E = 0 for 
s< 0 and s > %;, and B = 0 for all 5). When such an electrical pulse hits 
the particle we see from the new equation that 
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ctanh K, for 5<0, 
m= camay fo O0-«s«s, 
c tanh ф for 525, 
where 


eEs eEs, 
= — +K, and ф = + Ky. 
тс тс 


Using the requirement that the acceleration be zero when s is large we see 
that the Dirac equation yields 


etanho for 0< S< S 


ctanha for s< 0, 
Up = | 
ctanhó for ss, 


where 


eEc 54 $ 
«=K + [i-em -=)| exp- 
mc T T 


еЕт 5—81 eEs 
o= K+] 1 exp +——. 
тс т тс 


апа 


From the expressions for vy we see that the particle does not move until the 
pulse hits it. However, from the expressions for vp we see that the particle 
starts to move before the pulse hits, i.e. the particle experiences pre-accelera- 
tion. 


(d) Constant uniform magnetic field (B = constant and uniform, E = 0, 
particle assumed to have initial velocity perpendicular to the magnetic field). 
For this case exact solutions of the Dirac and new equations of motion have 
not yet been found. However, useful information can be gained by considering 
the actual orbit as a circular orbit perturbed by correctional forces. If the par- 
ticle’s motion were governed solely by the Lorentz equation of motion, the 
particle, of course, would move with a velocity v, along a circle of radius гу, 
its speed (v, . v,)!/? being constant. In reality the particle does not travel in 
a circle, but spirals inward. Calculation shows that the spiral predicted by the 
new equation is tighter than the spiral predicted by the Dirac equation. The 
first-order correctional forces for the Dirac and new equations are the same 
(we denote each by Е,). Likewise, the Dirac and new equations yield identical 
differential equations for the second-order velocity v4. On the other hand, the 
second-order correctional forces differ from each other (we denote the one for 
Dirac’s equation by F;, and the one for the new equation by F,,). The dif- 
ference of the second-order correctional forces 
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AF, = F, D F, N 
and the first-order correctional force F, are compared to the main force 
| |^ d Yi 
т — ——— 
аг 1 — 5^ 
in table I. In this table r, is the classical radius of the charged particle and 


2 


e mv, 


fe =, 
ев) 


=т=т ae a r 1 
42 & mc? 

The deviation of the two spirals depends on AF). In a typical electron synchro- 

tron this deviation would be too small to detect. However, for a highly energetic 

charged particle in a strong magnetic field, as in astrophysical applications, the 

deviation could be large. . 


TABLE I 


(1 — В2)° 
bt re 2 ғ, 
(1 — 6)? (=) (3) 


(e) Coulomb field (B — 0, E — electric field of a fixed point charge q). A 
charged particle that in the absence of radiation reaction would move in a circle 
about the fixed charge spirals inward toward the centre. In this case, contrary 
to the previous case, the spiral of the Dirac equation is tighter than the spiral 
of the new equation. 


GE e 
1 ву? 


From Ње above examples we learn that the new equation of motion (3) gives 
results which are reasonable and in (all?) laboratory situations hardly distin- 
guishable from the results of the Dirac equation (1). However, for highly energetic 
charged particles in very strong magnetic fields (as in some astrophysical situa- . 
tions) the difference between the results (e.g. radiation rates and radiation pat- 
terns) could be large and surprising. 


4. Conclusions and comments 


We propose that a particle of mass т and charge e in an electromagnetic field 
(E, B) should move in accord with the new equation of motion ?) and not the 
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standard Dirac (or Lorentz-Dirac) equation of motion 1). The fact that the 
new equation of motion is free from conceptual difficulties, is intuitively pleas- 
ing and yields reasonable results in several test cases, supports our confidence 
in its validity. 

It must be mentioned that an action integral for the new equation has not 
yet been found. Such an action integral would be very useful, but our not 
knowing what it is does not disprove the equation. Indeed, even for the Dirac 
equation, which has been with us for almost thirty-five years, the action integral 
seems to be open to question. 

An experimental check is required for the new equation. For this we must 
appeal to the astrophysicists because it is they who are concerned with particles 
of sufficient energy and fields of sufficient strength to place in evidence the 
difference between the new equation and the Dirac equation. 
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Suppose that there are п football teams and that an expert is asked to compare 
them by stating which one of a given pair (i,j) ( AJ, Lj = 1,2,..., n) is 
better. 

Having made (7) such comparisons, the expert may find that he has been 
inconsistent by having stated that i is better than j, j better than К, but that 
also К is better than i. 

The question is what is the maximum number of such inconsistencies. 

A. set of (7) comparisons can be recorded in the form of a matrix А = ((a,j)) 
where 


a 1 if i has been claimed to be better than j, 
VU ^ |0 otherwise. 


It is now clear that the number of inconsistencies Г, is given by the simple 
formula 


I, = 4 Trace (43). 
Let us now write 

А = (5+ Т) 

where 
Sy = 1— ду 
(6,; being the Kronecker delta) and 
у= +1 
satisfying the antisymmetry relation 
fj = ba 

It is clear that 


8 Trace (43) = Trace (53) + 3 Trace (S? T) + 3 Trace (5 T?) -- Trace (T?) 
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and that 


Trace (7?) = Trace (S? T) = 0. 
It is also quite easy to obtain that 


Trace (53) = n? — 3и? + 3n —n 
and a little more difficult that 


n n 2 
Trace (S T?) = — ( У 2 +n? — п. 
=1 


k і=1 


Finally, we obtain the formula 


and a little thought makes it clear that if п is odd it is possible to choose the 
t’s in such a way that 


n 
Уш =0 forallk 


і=1 


and that therefore (for и odd) 


n (n? — 1) 


24 


Max 1, — 


For n even one can choose the ?'s so that 


> f — Ll 
і=1 
for all К and hence (for и even) 
n(n? —4 
Max J, = жыйы, ; 
24 


It is not too difficult to show that if comparisons are made at random the 
average number of inconsistencies is 


dn) = — _ с 
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There is a moral to the story which Professor Bouwkamp, to whom this little 
exercise is dedicated, might appreciate. 

There is a tendency in contemporary mathematics to insist on using its objects 
in their “natural” setting. Since matrices are “merely” representations of linear 
‚ transformations, their natural habitat is linear algebra, and it would be border- 
ing on sacrilege to use them in a different context. 

But pragmatists like Professor Bouwkamp and the author tend not to take 
esthetic niceties that seriously and use the “catch as catch can” approach to 
solving problems. 

After all, why not? 
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1. The Fourier transform of fe L (— oo, oo), 


со 


Јо) = | окр бху) 70) 4 хед, 0) 


-0 


is a bounded continuous function. If also à eL (— oo, oo) then by a simple 
application of Fubini's theorem 


Јо) 50) а = | уд фо) ах, 


and this may be written as 
<h Ф) -Q à). Q) 


Most extensions of the Fourier transformation to generalized functions are 
based on (2), the best-known extension being that to tempered distributions. 
If фє5 then also фє$, and for fixed fe 5', (2) holding for all ф eS defines 
feS'. More generally, if Ф is a space of testing functions which is mapped 
continuously by the Fourier transformation into another space of testing 
functions Ў, then the adjoint map defines by means of (2) the Fourier trans- 
form of a generalized function fe У”. 

Beside this *method of adjoints" for generalized integral transforms there 
is also a "direct method" which has been used extensively in connection with 
other integral transforms ?) but so far has not been much used for Fourier 
transforms. This method depends on interpreting (1) as (f, e,», where 
ex(y) = exp (ixy), and finding a space of testing functions which contains e, 
for all real x. The only one of the commonly used testing-function spaces for 
distributions which has this property is E and accordingly one may define for 
distributions with compact support a numerical-valued Fourier transform 
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Fœ) =f 6 XER. (3) 


It can be proved (ref. 2, sec. 5.2.3) that F is a continuous (actually, entire) 
function of polynomial growth on the real axis, and that F generates fes’ 
defined by (2). We shall be concerned with the extension of all this to larger 
classes of generalized functions. 

The testing-function space which recommends itself in this connection is B, 
the space of smooth bounded functions with bounded derivatives and with the 
topology of uniform convergence on R of functions and derivatives. For 
ГеВ’, eq. (3) defines a numerical-valued Fourier transform of polynomial 
growth, and many features of the L'-theory of Fourier transforms including 
the basic formulae and the convolution theorem can be reproduced. However, 
D is not dense in B, B' cannot be identified uniquely with a subspace of dis- 
tributions, and certain difficulties arise. 

This disadvantage of B is sometimes removed by considering the subspace B 
of those à e B which vanish together with their derivatives at infinity. D and 
S are dense in B, and (B)' can be identified uniquely with a subspace of distri- 
butions. However, B cannot be used for our purpose since it does not con- 
tain ey. 

Instead, we shall restrict ourselves to those fe В’ which remain continuous 
if in the topology of B, “uniform convergence on R” is replaced by “bounded 
convergence on R and uniform convergence on each compact subset of R”. 
For these the generalization of the L'-theory can be completed. The Fourier 
transform is now a continuous function which generates / of (2), and an inver- 
sion formula can be proved. 

We shall follow as far as possible the notations and conventions, and will 
use some of the results, of Zemanian’s book on generalized integral transforms. 


2. For фе С°(К) and ke № we set 


Bb) = sup {|#%()| : x e К}, 
УФ) = max {Bo(4), В.(Ф), . . -> BC) 


The В, are seminorms, f, and the у, are norms and the collection of smooth 
(i.e., infinitely differentiable) complex-valued functions on R endowed with the 
topology generated by the В, (or, equivalently, by the yx) is a complete countably 
multinormed space B (ref. 3, pp. 39, 41). Convergence in B is uniform con- 
vergence on R of the functions and all their derivatives. B is a testing-function 
space: in the usual notation of testing-function spaces for distributions, 
DcSc Bc E. Neither D nor S is dense in B (ref. 3, p. 41), indeed the clo- 
sure of D or S in B is B, the subspace of all those $ e B satisfying $?(x) — 0 
as |x| — 00, 

The elements of B’, the dual of B, are generalized functions. fe L'(R) 


(4) 
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generates an element of B’ according to 


<f Ф) = SI Фо) dx, — $eB: (5) 


for this reason elements of B’ may be called integrable generalized functions. 
By a general theorem on duals of countably multinormed spaces (ref. 3, 
theorem 1.8-1), for each fe В’ we have an integer r and a non-negative С 
so that 


K6e|«c»2. ФВ. (6) 


The smallest integer r for which a relation of the form (6) holds is called the 
order of f, and the least C for which (6) holds when r is the order of f is called 
the modulus of f and is denoted by C,. In fact, fis a bounded linear functional 
on a certain space normed by y,, and С, is the norm of this functional. The 
functional defined by (5) is of order zero, and 


С, = f |f 9| dx 


for it. The restriction of fe B’ to D is a (tempered) distribution but, as D is 
not dense in B, f is not fully determined by this restriction. The restriction 
to B of any distribution with compact support is an integrable generalized 
function. 

For each x e R we have 


e€B, Bex) = |х|, уме) = max (L |x|*}. (7) 


It follows that for any fe B', (3) defines a numerical-valued Fourier trans- 
form which turns out to have many of the properties of Fourier transforms 
on L'(R). By (6) and (7), F(x) is of polynomial growth and in particular, 
F(x) is bounded if fe B’ is of order zero. The familiar formal relationships 
hold: 

Fx+h)=Gx) if (у) = exp (hy) fO), 

exp (Лх) Р(х) = G(x) if  g0)—f((— B, (8) 
—ixF@)=Ge) if 20)— D,fQ). 


since 
<exp (Лу) f(y), еу)» = €f (у), exp (ihy) ex)» = €f; exci», 
(f — h), e)» = </ (у), еду + D)» = €f, exp (ihx) e» 
and <D S, еу =—<f, De = (р, i X ex). 


Like L'(R), B' is a convolution algebra. For fe В’ and ge B', we can de- 
fine f» g e B' by 


<f* g, Ф) = <S), (80), 9x + 55, EB. (9) 
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To show that (9) is meaningful, consider 
6,(x) = h- (ф(х + 5) — ФО) — ф'О). 
Then 
6, (x) = AAPOR + 1) — HR) — dq) 
= pD + Oh) — ott D) 

with 0 < 0 < 1, and by a second application of the mean-value theorem 

(х) = 0hd**2(x--0,R, | 0 0,«0«1. 
It follows that 0, — 0 (B) as h — 0 and by the same argument 

KEPE + һу— GR) > Ф800) (В) аз  h- 0. 
This shows that 

yx) = «80. Ф(х + y) 


is smooth and 


y? = (g(y), à (x + y)». 


If g is of order s, we have from (6) that f(y) < C, Yk+s($) and so v eB, 
Cf, v» exists, (9) is meaningful and 
Kf $; Ф < С, С, уф), 


so that if fe В’ and g € B' are of order r and s respectively, then f» g e B' 
and is of order at most r -+ s. 
The convolution theorem 


(fx & Ex? = Cf ex) (8, €x? (10) 


now follows easily, for 


(Cf ж g)(y), exp (ixy)> = €f). <a), exp [іх (у + 2D? 
= < f(y), exp (ixy) <g(z), exp (ixz)>>. 


Since the restriction of f to S is a tempered distribution, (2) with ¢ eS de- 
fines another Fourier transform of fe 5' which is a tempered distribution. One 
would like to know if F generates f, i.e. if 


f FG) фо) dx = Cf, фу. (11) 


However, it is not at all clear that F is locally integrable. If f is an integrable 
function or a distribution derivative of an integrable function then F is con- 
tinuous and the left-hand side of (11) exists, but it is clearly untrue that for 
every fe В’ the Fourier transform is continuous and (11) holds. Zemanian 
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к E т тш кы, олы ЕН iini НЛА „=й, 


(ref. 3, p. 41) has shown the existence of fy e B^ for which 


СЉ, 9) = lim ф(х) 


whenever this limits exists. Now for $ éS we have 


| lim d(x) 20 as X — оо 
and so s 


(fo d» = 5,9) =0 —foralléeS, 


while 
F,(0) = (fo, 1) —1 
and so, if Fy is continuous, the left-hand side of (11) cannot vanish for all 


¢ € S. Nevertheless for a restricted class of integrable generalized functions we 
shall prove that F is continuous and (11) holds. 


3. With фе S, let 


з) = f exp (у) фо) dy. 


b 


Clearly, y, e C*(R): 


Bute) < | |у* $09] d» 


and so y, € B and y, — 0 (B) as b — oo. Similarly 


[9969)40)4y— 00) as а-ә, 


-%0 


and in view of (2), (11) will hold if we can prove that for fixed a, be R 


[ 70 90) & = (109, [ o @х) 40) a), а> 


For this, however, further assumptions on f appear to be necessary. 

Call a sequence ¢, in B almost convergent and write ¢, — ф(В,) as n — со 
if $, € B and for each ke N, ф„®(х) > d(x) as n— со boundedly оп R 
and uniformly on each compact subset of R. Let By’ be the collection of 
those fe В’ for which (f, $a — 0 whenever $,—-0(B,). See also ref. 1, 
Ch. VI, 8 8 and Ch. VII, $ 7. 
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Now, for fixed x e А, we have e, 4.4 — e, (Bo) as h — 0 and so, for fe By’ 
the Fourier transform defined by (3) is a continuous function of x, and both 
sides of (12) are meaningful. We shall prove that they are equal, not only for 


$ €S but for any à e C fa, D]. 
Let л be a partition of [a, b] given by 


а= уо LYZ... < уһ =b 
and 


А W-1 SH < Jv, I <r <n; 


and let 
|= = max {= v-il :1<r <n} 


be the norm of that partition. With ¢ e C [a, b], 


ал) = | exp (oy) 40) 89— У ex Cen) 00,2) 


п 


= ` J [exo 69) ФО) — exp Gem) $6191 dy 


y 
у=1 "7i 


defines 0 e B and 


v-1 


(x, л) = У, it | [exp (ixy) у“ фу) — exp (ixny) n," Ф()] dy. 
vel ? 


Since 
еә] у [qr 6001 + p dp аә 


У, 
vat "7i | 


< 2 (b— a) max {|y* 40) :a < y <b}, 


(x, л) — 0 boundedly as |x| — 0, by Riemann integration theory, and 
convergence is uniform on compact subsets of R since exp (ixy) у“ ¢(y) is а 
uniformly continuous function of y if x is restricted to a compact set. Thus, 
0 — 0 (Bo) as |z| — 0 and for fe By’ also <f, 0) — 0. It now. follows that 
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b n 
(7 (х), i exp (ixy) ф(у) ay) = lim (7 (x), У, exp (іх) Ф) О» — »-2) 


= lim Y F(q)$Q)O0.—»-12 as — [x| >, 
у=1 
but the last limit is equal to the left-hand side of (12) since F $ e C [а,Ь]. 


4. We conclude by proving an inversion formula. For fe B,’ and F(x) = < f, ej», 


1 b 
&40) = — | FO) exp (—ixy) dy (13) 
2л : 


defines a bounded complex-valued continuous function which generates a 
tempered distribution. We shall show that as a — — oo, b — со this tempered 
distribution converges to the restriction of f to S, i.e. that for each $68, 


(8.5 42 (f, d? as | a——co,b— о. (14) 


By Fubini's theorem 


b 


Г A 
Gs 4 = | (= [ 9» C99» FO) 7) Hx) dx 


ы р И—) d 
Е [ FO) day. 


Noting that (2)-+ ¢(—y) eS, (12) now shows 


ps А 
LEa,» Ф) = (7 (х), 2 | exp (ixy) Ф(—у) 77 


and by the work of sec. 3 preceding (12) 


1; x 
Go (762, - Гарбо) dcn) 
2л 


-0 


as а — — со, b — œ. But by Fourier inversion the last expression is equal to 
the right-hand side of (14). 

Every ¢ e B is the B, limit of elements of S, indeed of D. For instance, take, 
for each n e N, а, € С°(К) so that a,(x) = 1 when |x| <n and «,(x) = 0 
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when А >n+1. Then «$ — $ (Bo) as n— о. Now, let фєВ and take 
¢,€5 so that ġa — $ (Bo) as n— оо. Then (14) determines < f, $4,» and as 
fe Bo; (fo bn — Cf, d» as n — oo, so that we have 


b 


1 
(f = lim. lim e [ exe Ci») FO) ду, ко) аз) 


п-—=оа-——© 
b= со 
as the inversion formula for fe By’. If 4, is another sequence in S which 
almost converges to ¢, then ¢,— y, — 0 (By) as n — œ, so that 


Cf фә — <5 WPa? — 0, 


and the left-hand side of (15) is independent of the sequence chosen for ap- 
proximating 4. 
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Abstract 


It is physically intuitive that the dominant term in the asymptotic tem- 
perature distribution of an arbitrary body with constant thermal proper- 
ties and subject to a uniform heat flux at its surface can be gotten by 
assuming it will be the same for all points of the body. Whence, T ~ at. 
This result is shown to hold even if the surface flux varies with position. 
In addition, the rest of the quasi-steady-state solution is shown to depend 
only on position and is given by the solution of a corresponding 
Neumann problem. The results are then extended to periodic heat-flux 
conditions and to composite bodies. 


1. Introduction 


There are not too many exact solutions known for heat-conduction problems 
in which the geometry and/or boundary conditions are not simple. However, 
if one is mainly interested in "large"-time solutions, one can still, exactly, 
obtain the dominant term in the asymptotic expansion. Here, we determine 
the asymptotic temperature distribution for arbitrary homogeneous bodies and 
then for homogeneous composite bodies with constant thermal properties and 
subject to two kinds of non-constant boundary conditions. To illustrate the 
method, we first start off with the simplest of this class of problems. 


2. Here, we determine the asymptotic temperature distribution for an arbitrary 
smooth homogeneous body with constant thermal properties and which is 
subject to a space-varying heat flux over its surface (fig. 1). Consequently, 
we wish to solve the B.V.P. (boundary-value problem) 


oT 
—-—avtT (in V), а) 

or 

oT 
—k — = Q(x, y, 2) (on S), (2) 

on 
T(x, У, 2, 0) TE To(x, У, 2), (3) 


for “large” values of the time. As usual, k denotes the thermal conductivity 
and а denotes the thermal diffusivity К/со (c is the heat capacity, o the density). 
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Q (x,y,z) 


Fig. 1. 


Since the total amount of heat which has flown into the body at time ¢ is 
t f f Q dS, we expect intuitively that the leading term of the asymptotic tem- 
perature distribution will be proportional to ¢. This is indeed valid. To prove 
it, we let 


T(x, y, 2, 1) —aat ф(х, у, 2) + T(x, у, 2, 1), (4) 
where 
V? ф = Gin V), (5) 
—К of = (оп S), (6) 
on 


(a is a constant to be determined). These conditions on ¢ were set to insure 
that T also satisfies (1) subject to an insulated boundary condition and con- 
sequently 7 will approach a constant asymptotically. 

From (5) and (6), it follows by the divergence theorem that 


/] о8=к | | | 7 =кар= 05, 


where O denotes the average flux over the surface. Consequently, in order that 
(5) and (6) have a solution it is necessary that 


a= : 
kV 


(7) 


Furthermore, that the solution of ¢ is unique, up to an arbitrary constant, 
follows from the known uniqueness of the solution of V? А = 0 (in V) subject 
to då/dn = 0 (on S), aside from an arbitrary constant. To fix this arbitrary 
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constant, we also impose the condition 


f [ 0,2—6, z]] dV = 0. 
У 


This will insure that T(x, y, z, t) will approach zero as t—> оо instead of 
some non-zero constant. 

Equations (5) and (6) can be easily transformed into a classical Neumann 
problem by letting 


P(X, у, 2) = ar?/6 + у(х, у, 2), 


where r? = x? + y? + z?. Whence, 


V7 4=0 (in V), 
ox ka or? 
—k—= —— (оп $), 
on 6 d 


This Neumann problem has a unique solution for smooth SS since 


ш = 0 


by virtue of a = OS/KV. 
From the definition of ¢, it follows that (as was planned) 


T 
—=av?T (in V), 
— = (on S), 


T (x, № 2, 0) == To(x, » z) iu $(x, У, 2). 


Consequently, T is the solution of ће heat-flow problem of a body which is 
insulated and whose average initial temperature is zero. Whence Г + 0 as 
t— со. А 

We have thus established that the quasi-steady-state temperature is exactly 
given by ааг + ф(х, у, 2). For simple geometries (i.e., one-dimensional 
flow in slabs, cylinders and spheres), ф can be determined explicitly. For 
more complicated bodies, ¢ can be determined numerically. 
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3. Here, we change the problem in the preceding section by considering a 
periodic (in time) heat-flux boundary condition. Consequently, (2) is changed to 


T 
—К = = Qx, У, 2) sin (wt) + Q;(x, У, 2) cos (wt) (2’) 
п 


while (1) and (3) remain the same. Since we physically expect the temperature to 
asymptotically have a form similar to the r.h.s. of (2^), we let 
T(x, у, Z, t) = ф(х, y, 2) sin (wt) + x(x, у, 2) cos (wt) + T(x, у, z, t), (8) 
where 7 is to satisfy (1) and should approach a constant value. Whence, 
[¢ cos (wt) — x sin (wt)] w/a = sin (wt) V? ф + cos (ot) V? x (in V), 


and 


o¢ ox 
—k (sa (wt) pon + cos (wt) =) = Q, sin (wt) + О, cos (wt) (on S). 
n 


Consequently, 
а ү2 ф = —0 x, «V? 5 = 0ф (іп И), (9) 
subject to the B.C.’s 
d$ òx 
—k— =Q, —k—= Q: (on 5). (10) 
òn òn 


We first show that ф and y are unique up to arbitrary constants. Then by a 
known theorem, the uniqueness implies the existence of ф and у *). Assume 
that there exist two different solutions ¢,, y, and ¢2, %2, and form their dif- 
ferences 


Whence, 
«V*$-—ox aV?x~=o¢ (nV) an 
d$ ox 
— = — = on S). 12 
m (on S) (12) 


From (10) it follows that 
[[ [Gv $9xvinar- c. (13) 
V 


Since 
AV?AzEV.ANVA—(NVAY, 


*) I am indebted to L. Nirenberg (Courant Institute) for this argument. 
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(13) can be rewritten (using the divergence theorem) as 


ы Е 2 z E d$ dy 
MG P+D )v-[IG = + )45. 
By virtue of the B.C.’s (11), we must then have 
(V $9 + (у 3? =0 


and thus $ and x must be constants. 
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(14) 


Explicit solutions for ¢ and y can be obtained for one-dimensional flow in 
slabs, cylinders and spheres +). For more complicated geometry, we can again 


resort to numerical methods. 
It follows from (8), (11) and (12) that 


oT 
—-.vT (on V), 
or 
oT 

—k—=0 (on S), 
on 


T(x, J, 2, 0) = To(x, ys z)— x(x, » 2). 


Consequently, 


lim 7 : Ti dV z Ti tant 
д 207 Jt o— 2X) = То (constant) 


and the asymptotic temperature distribution is given by 


T(x, y, z, t) ~ To + $, у, 2) sin (wt) + у(х, у, 2) cos (wt). 


Incidentally, 


ава. | poids 
| [4 = zal {eae 


(15) 


4. Here, we generalize the problem in sec. 2 to arbitrary composite bodies, 
fig. 2. However, for simplicity it suffices to just consider a composite of two 


different materials, see fig. 3. Our B.V.P. is now 
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Q(x,y,z) 


Fig. 3. 


or | 
—=a \/?Т (їп V), 
ot 


oT; Р 
с =a, V? T, (in Vj), 


oT 
—k — = Q(x, y, 2) (on S), 
on 


T(x, J; 2, 0) = To; T(x, У, 2, 0) = То. 


T =a a t+ $x, y, z) + TG y, z, t), 
Т, = d, ut + ф(х, у, 2) + Түх, y, 2, t), 
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where T and T, satisfy (16) and (17), respectively. Consequently, 
V’ ф =a (in V), (21) 
V? ф = а (in И). (22) 


In order to make T and T, approach constants as ? — оо, we impose the 
B.C.’s: 


дф 
—k — = Q(x, y, 2) (on S), (23) 
on 
ad = а 0 (24) 
$ = $i 
" LÀ " p% | (on Sı). (25) 
on on 


In order for the latter Neumann type of problem to have a solution, the heat 
input per unit time via the flux Q must be exactly picked up by the distributed 
heat sinks corresponding to a and a, in the Poisson equations (21) and (22). 
Consequently, as in sec. 2, we must have by the divergence theorem that 


Jff sar- [Eass | [has 


or 
S k ò 
k dn 
Si 
Also, 
d$ 
ge ПСЕ 
Whence, 
OS=kaV+k,a,V, 
and 
OS a, 
kV -+ aki Ve 


It follows now that ¢ and ф, are determined uniquely up to the same arbitrary 
constant. As in sec. 2, we fix this constant by also requiring that 


[ [ [ear f | |с), = o. 
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Since the B.V.P. for Т and Т, is 


T z dT, 
—— = @\/?Т, — = оу? 1, 
[4 of 
k d S 
= um = (оп S), 
T= 1, 
oT of, | (оп S), 
k—=k,— 
on on 


[[ [Tav [ [ [nar.—o (at t= 0); 


T and T, — 0 as t — œ. 


5. The last problem to be considered is the same as the one in sec. 4 with the 
exception that boundary condition (18) is replaced by 


oT 
—k a Q(x, y, 2) sin wt + Q2(x, y, 2) cos ot. (187) 
n 


We now let 
T = $ (x, y, Z) sin wt + x (x, y, 2) cos wt + T (x, y, z, t), 
Т, = ф(х, » 2) sin wt + xi У, 2) cos wt + T(x, У, 2, t), 


where Т and T, are to also satisfy (16) and (17), respectively, and approach 
constant values as t — oo. This requires (as in sec. 3) that 


а V? $ ——oxy, а V?x —oó Gn V), (26) 

u V? hi =—0 x, m V^xi— o4, бай), (27) 

A = Q, 4 = 0, (оп S), (28) 
$= Ф, х= 

oe ze dhi | p2% = yt (on S,). (29) 


on on on on 
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By a similar argument to that used in sec. 3, we can establish uniqueness up 
to arbitrary constants for ф, y, $ y, since we can obtain 


(V 3 + (V 3) + (7 6)? + (V х)? = 0 


where ф = ф, — $5, etc. 

Explicit solutions for ф, y, ф, and y, can be obtained fairly simply in this 
manner for composite slabs, cylinders and spheres and will be given in a sub- 
sequent paper. 
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Abstract 


In this paper, we study the following game: We consider two players, 
whom we shall call the defense and the offense respectively. There are 
n properties T,, ..., Т, with values vy, ..., v, on which values both 
players agree. The offense has total resources normalized to 1 while 
the defense has total resources A; the interesting case is л > 1. The 
process of the game is that the defense first apportions its total resource 
among the properties Т; and then, with full knowledge of this assign- 
ment, the offense divides its unit resource among the same properties. 
If the defense assigns /, and the offense assigns a, at T;, the offense wins v, 
if and only if a, > hı. The problem is to spread the defense’s resource h 
in such a way that the total value of the properties taken by the offense 
under the offense’s best strategy is minimized, i.e., (41, ho, ..., An) 
must be such that 


max У Vi 
ау+аз+ ... +аһ=1 {i | a>} 


is minimal under the constraints Ay + й, +... + n Sh, each 
hi> 0. We shall prove that the problem is equivalent to a problem 
in the theory of linear equations. From this it will follow that for every n 
there is a finite set of defenses (Лү, A2, . . ., 4,) such that whatever A is, 
and whatever v1, v5, ..., v, are, an optimal strategy is in the set. We 
Shall also show how such a set may be constructed. 


1. Introduction 


In 1965-66 the authors studied an asymmetric game which had an interesting 
combinatorial structure; except for an internal Bell Laboratories memorandum 
the work never reached publication. However, it turns out to be closely related 
to Shapley's balanced sets of coalitions for a multi-person game ?) and thus to 
the notion of the core. We therefore present our earlier results, fortified by an 
important counterexample due to N. G. de Bruijn which has clarified the struc- 
ture considerably. We are particularly grateful also to L. S. Shapley *) for a 


ASYMMETRIC CONTEST FOR PROPERTIES OF ARBITRARY VALUE 41* 


number of references; we have not, however, attempted to connect our problem 
in detail with the flourishing theory of n-person games. 

We consider two players, whom we shall call defense and offense rather than 
first and second player in order to emphasize the asymmetry. There are n prop- 
erties Ту, ..., Т, with values v,, ..., v, on which values both players agree. 
The offense has total resources normalized to 1 while the defense has total 
resources Л; the interesting case is h > 1. 

The process of the game is that the defense first apportions its total resource A 
among the properties Т, and then, with full knowledge of this assignment, the 
offense divides its unit resource among the same properties. If the defense 
assigns h, and the offense assigns a, at Тү, the offense wins v, if and only if 
a, > h,. The problem is to spread the defense’s resource А in such а way that 
the total value of the properties taken by the offense under the offense's best 
strategy is minimized, i.e., (h,, h2, ..., hn) must be such that 


max У v 
aytagt+...+a,=1 dila] 


is minimal under the constraints A, -+ й, +...-+ №, <Å, each №, > 0. 
Remember that we assume the offense to be designed with full knowledge of 
the defense, that is, of (h4, . . ., 4,). We shall prove that the problem is equiv- 
alent to а problem in the theory of linear equations. From this it will follow 
that for every n there is a finite set of defenses (h4, hz, . . ., An) such that what- 
ever h is, and whatever vi, v2, ..., v, are, an optimal strategy is in the set. 
We shall also show how such a set may be constructed. 

In the present mathematical model, no Т, needs to be defended with strength 
> 1 to protect it from the offense. More generally, among all defenses for 
which the total property value which is lost is the same, it is sufficient to con- 
sider those with minimal А. If for a T, the defense is 0 then it is sure to be lost. 
We shall call a defense “essential for n" if 0 < h, < l for i = 1, 2,..., n. 
In general, a defense for n properties will have the form (1, 1, ..., 1, hi, ha, 
s Во 0, 0, . . ., 0) where (hi, hz, ..., hj) is essential for k properties, k > 0. 


2. The main theorem 


We consider a set T := (T4, ..., Ta} of n properties. Let (h,, ..., An) be 
any defense, 


h := p» hy. 
j=1 


With each nonempty subset S C T we shall associate an equation or an in- 
equality and we shall order these in a special way described below. For the 
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given defense (hı, ..., А,) and for any S C T there are three possibilities, 
namely 


Y 


УА 21. 


Tj;eS 


First we consider all subsets S for which 


У = 1 


Туе$ 


and with each we associate the equation - i 


Y xe. 


T,eS 


In this way we obtain a system of linear equations of which (/4,, .. ., An) ap- 
parently is a solution. Let / be the number of these equations. From this 
system of equations we pick a maximal collection of linearly independent 
equations. In the following we shall use k (k </) to denote the number of 
these equations, number the corresponding sets S C T as S,, S5, ..., S, and 
write the k equations as 


У &yx,= 1, G=1,...,%), (1) 
J=1 


where &у = 1 if T, e S, and ғу = 0 if T, ¢ S,. The remaining equations and 
the corresponding subsets of T are numbered with indices k + 1, k + 2, .. ., I. 
Hence each of the equations 


У ey xy = 1, G=k+1,...,/) (2) 
Ј=1 


is linearly dependent on the set of equations (1). Next we consider subsets 
Sc T for which 


yay > 1. 


TjeS 


With each of these we associate an inequality 


2m cel 
ier (3) 


Again (hy, ..., ^,) satisfies all the inequalities obtained in this way. The sub- 
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sets S now under consideration are then numbered $,,,, ..., Sj, and the 
inequalities are written as 


‘ 


Yd. ОЕ m) (4) 
j=1 А 


Finally we look at the subsets 5 С Т for which 


У hc 


TjeS 


Continuing the process started above we number these subsets $,,,,, . . ., Sr, 
where r — 2" — 1 and write the inequalities as 


У ey xy) <1, (1 = т +- 1,..., р). (5) 


Ј=1 


In the following, when describing а defense, we refer to (1) to (5) and use 
the symbols К, / and m as defined above. 


Definitions 

A defense (/, ..., Л„) is said to be saturated if, in (1), k = n; that is 
(hı, ..., А) is the solution of a set of n linearly independent equations of the 
form (1). 

A defense (h,’, ..., h,") is said to dominate (h,, ..., h,) if 


@ Veer] Zs219Xai] 
TjeS TyeS 
and 
(ii) LA < Dy 
j=1 j21 


both hold. If a defense is dominated only by itself we shall call it undominated. 
If, for two defenses (h,, ..., h,) and (h,', ..., Ay’), 


ЕРЕ) 


the two defenses аге called equivalent *). Notice that two equivalent defenses 
need not have the same total A. 


*) sgn0 = 0. 
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. Our first main result is the following. 


Theorem 1 

For every defense (hı, . . ., h,) there exists a saturated defense (h,', . . ., hy’) 
which dominates (h,, . . «Кү 
' Proof. Let us assume that there exists a defense (hy, ..., An) for which the 
conclusion is false, ie., there is no saturated defense which dominates 
(his ..., An). Then there exists such a defense with minimal n, n > 2. First 
we shall show that such a defense is essential for n. For if, say, ^, were 0 or 1, 
we could then consider the set Т\{Т„} with the defense (/, ..., A,-1). Since 
n was minimal there is a dominating saturated defense (h,', ..., An—1') for 
T\{T,}. If we adjoin to this defense the previously removed h, for property Tn, 
we have a dominating saturated defense on T. This last assertion is not trivial 
and the reader should take the time to convince himself, thus familiarizing 
himself with our way of using (1) to (5) to describe defenses. We may now 
assume that the defense (h4, . . ., An) is essential for n. If it is itself saturated, 
we are finished. If not, then, for this defense, k < n, and the system (1) deter- 
mines an (n — k)-dimensional subspace D of Euclidean n-space. If we impose 
the additional constraints (4) and (5), and the further conditions 


0<x,< 1, ї=1,2,...,п, (6) 


we define a convex subset H of D. H is not empty, since (h,,..., An) є Н; 
since the inequalities (4), (5) and (6) are strict, we know by continuity that H 
contains other points as well. In some nonempty portion of the boundary of H, 


For if the equation 


is linearly dependent on (1), then — 


Y х =й 


і=1 


everywhere іп H and on its boundary. If 


n 
у х= 
i-i 
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is linearly independent of (1), then it is possible to proceed from (hi, ..., А) 
towards the boundary of H 50 that 


IMs 
E 


decreases. At any such boundary point, at least one of the inequalities in (4), 
(5) and (6) must be an equality. If one of (4) or (5) became an equality, then 
this equality would have to be linearly independent of the set (1), for otherwise 
Ў £; x, in that inequality could not change in value. Thus, at such а boundary 
point, k would increase. At a boundary point at which (6) were to become an 
equality, we would have an equivalent defense which is not essential for n, and 
this case has already been covered in the previous paragraph. 

Thus in all cases, there is either a dominating saturated defense, or a 
dominating defense with larger k. This process can be continued until k = n, 
and the theorem is proved. [J 


We have thus proved that we can restrict ourselves to saturated defenses. 
But every saturated defense (/,, й,,..., h,) is the solution of n linearly 
independent equations 


У ey x, = 1, i=1,2,...,n, 
Ј=1 


where each e, is 0 ог 1. There are, moreover, obviously at most (2"-:) such 

systems, and hence only a finite number of saturated defenses for n properties. 
The exact number of such defenses as a function of n is unknown. The number 

for n = 1 to 5 respectively has been found to be 1, 2, 4, 9, and 26, where two 

defenses obtainable from each other by permutation are counted only once. 
Let 


n 
У ауху== 1, i=1,2,...,k, 
J=1 


be a set of equations of form (1). Let R;(j= 1, 2,..., п) be the subset of these 
equations in which г;; = 1 rather than 0. | 

The set of R; is a balanced set in the sense of Shapley, e.g. ref. 7, and the 
x, are their weights. The collection of saturated defenses corresponds in this 
way to the collection of minimal balanced sets 7). This correspondence was, to 
our knowledge, first utilized by Graver *). Complementation of balanced sets 
has its analog in a transformation of saturated defenses described in the next 
section. 
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3. (0—1) determinants 
Consider a matrix D = (d,,) where all dı; are 0 or 1. We shall use the sym- 
bol D to denote the determinant of this matrix. Let Ху, х2, ..., Xn be the 
solution of the system 
У a6 =D (== 1з п). (7) 


Ј=1 


In this paper we are interested only in those (0, 1)-matrices for which x,, 
..., хь are all > 0 but in the following two lemmas we do not use this. We 
define a complementation of a matrix as replacing all zeros by ones and all ones 
by zeros in a specified collection of columns. Let D (j,, ja, . . ., ji) denote the 
result of complementing the columns jı, ja, . . +» Jk and let D (ju ja; <- +s Ji) 
be the determinant of this matrix. 


Lemma 1. D (ja jo -< dd) = CD Ory, Hyg t c xj, — D). 
Proof. By Cramer's rule 
Furthermore 
D (js јој) + D Gn jas -- ng Лк—2, Jj) + 
+ Dis ja -- -odn—1) + Di; jas - + еа) = 0 
because it is ће determinant of a matrix with two columns consisting only of 
ones. The lemma now immediately follows by induction on k. 


Lemma 2. If D* = (dyjy*) = D (Qu jas - - Ji) then the solution of 


У djté;=D* (i=1,2,..., n) 


j-1 


ё = (—1)* x, if | FEC ts Jas + +, ЛӘ» 
== ey Xj if je Gi jz we o Ji. 
Proof. Apply Cramer’s rule and then lemma 1. П 


Consequence. We can now find saturated defenses by starting with (0, 1)- 
matrices with nonzero determinant, solving the corresponding set of equations 


n 
Yd,E-D ({=1,2,..., и), 
Ј=1 
and complementing the columns corresponding to the negative x, (or alterna- 
tively those corresponding to the positive x) in the solution. If (x 15 X255: X») 
is the solution of the equations corresponding to the matrix D* thus obtained, 
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= ( ) 


is a saturated defense. 
Remark. If (hy, Р, ..., hn) is a saturated defense and 


X_* 


D* 


a" 


D* 


РЫ 


р* 


> з... 


s=} h>l 


ї=1 


hy hz h, ) 
Seb 5—1, 5—1 


is also a saturated defense. It is found by complementing all columns of Р 
(cf. lemmas 1 and 2). This means that in constructing all defenses one could 
make the restriction that 


then 


iM: 
> 
V 
N 


(because either s or s/(s — 1) > 2). 
For numerical purposes this is not an efficient way to list all saturated 
defenses, given л. А more efficient way is described below. 


Definition. Consider the set of (0, 1)-matrices D = (d,,) for which the sys- 
tem (7) has a solution (x,, x2, ..., Xa) with all x, > 0. For each of these D 
we define D :— D/gcd (x, x2, ..., Xa). This is obviously an integer. Let М, 
be the maximum of D over the set. Then any saturated defense for n proper- 
ties consists of rationals with a common denominator < М,. 

As it is sufficient to list all essential defenses in compiling a list of defenses 
one can proceed as follows. Let m be any integer, n < m < N (we shall bound 
N in a moment) and partition m into n positive integers. If 


m=n+k=p,+pot+...+D, 
is one of these partitionings, and p, > p2 >... > р, then 


(p,/d, P2ld, эжеў Р.) 


with p, x: d < M, may Бе a saturated defense. This is the case if there are 
n linearly independent equations (1) for which (p,/d, p2/d, ..., p,/d) is the 
solution. This is often not so and therefore quite a lot of these potential de- 
fenses are excluded. 


Example. m=5=2+1+1+4+1, d= 3. Since (2, 4, 4, 4) is the solution of 
the 4 independent equations 


X, + X2 = XQ + X3 5 X1 + X4 = Xa + Xa ха = 1 
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we have found a saturated defense. For a total defensive strength > n/2 there 
is no essential defense except (4, 4, . . ., 4) so we can stop the above construc- 
tion at N = [5 n M,], where [x] denotes the greatest integer < x. 


4. The value of Mn 


The number M, defined in sec. 3 is interesting in itself. Little is known about 
the growth of M, as n increases. Clearly, by Hadamard's inequality, M, < n"?, 
but this is undoubtedly a very loose estimate. In the set of (0, 1)-matrices used 
to define M, there are elements with a very large determinant namely of the 
order of magnitude n"* 9/2 27" (cf. ref. 6, p. 107 and other estimates in 
refs 1, 2 and 3), but for these gcd (x,, x2, . . ., Xn) is also large and hence the 

, corresponding D small. For n = 2, 3, 4, 5, 6, we found M, = 1, 2, 3, 5,9, i.e., 
for n < 6 we have M, = 1 + [2-3]. We now show that this is a lower bound 
for all n. To do this we restrict ourselves to (0, 1)-matrices D = (d,,) for which 
the solution (х;, x2, ..., Xa) of (7) has the property that all x, are >0 and 
gcd (ху, х2, ..., X,) = 1. In this subset D = D. 


Theorem 2 
M, >1-+ 2°-3 for п >3. 


Proof. For n even (> 4) let D, = (4?) be a (0, 1)-matrix with determinant D, 
with the following properties: 

(а) D, = 2—3 11; 

(b) if (x4, x2, ..., Xn) is the solution of the system 


а, & = D, (21,...,n) 


1 


TMs 


then x, + x4 +... + X, 2 = 2773, х, = 2"7°, x, = 2"74 4 1 (note 
that gcd (x1, x2, ..., х) = 1); 
(c) D, has the form 
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Define D,,., as follows: 


Окы t= | 0 


0 


From the last column subtract the preceding two columns and then expand 
Dna Бу the first row. This gives 


n-2 
Dari = Y Cit Ht x, + D, = 277? +1 


і=1 


and the solution of 
n+1 


È di, for 1) £j = 21-2 +1 


Ј=1 


(2X1, 2x2, . .., 24.2, 2х„..,—1, 2x, — 1, 1), 


which is easily checked by substitution. At the end of this proof we will have 
shown that a D, with properties (a), (b), (c) exists for all even n >4. The 
preceding step of the proof then implies that M, > 1 + 27? for odd n > 5. 
Next define D,,;' as follows: 


P.I 
Di? e 


= © O н 


Dia; 


me s.’ 


Expanding by the first row we have 
Р, = Dns + Das 1* 
where D,,., and D,,,* have complementary last columns, i.e., by lemma 1, 


Darz’ = —(2"-? + 1) + (1 —(2"-2 + 1) = —(2"-1 + 1). 
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The solution of 
n42 


Y dyt? £j = 20-1 +1 


j=1 


(2x1, 2x2, sees 2х,-2, 2Xn—1 T 1, 2x, tT 1, 295* + 1, 2A) 


which is again easily checked by substitution. If we define D,,,.. by interchanging 
‘the last two columns of D,,;' then О, satisfies conditions (a), (b) and (c). 
Theorem 2 is now proved by induction if we give an example for n — 4 and 
п= 3. For n = 4 the example is 


For n — 3 we have 
101 
110|= 2. LI 
011 


Remark. The inequality in theorem 2 cannot be replaced by equality. This is 
shown by the following example: 


D, = 


` The set of equations associated with the preceding D; is 


X, + X2 + X3 = 18, 


xı + x2 + ха + x7 = 18, 
x, + х2 + Xs + х6 = 18, 
x, + хз + ха + Xe = 18, 
xi + xs + Xs + х6 + хт = 18, 

X2 + Xs + ха + х5 = 18, 


х + хз + ха + х6 + ху = 18. 
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The solution is (x4, ...,; x7) = (7, 6, 5, 4, 3, 2, 1). The balanced-set form of 
this example is due to Jacqueline Shalhevet in 1968. О» = 38 can now be 
constructed by an ingenious idea due to Peleg 5). We observe each equation 
contains either x2, хс, or both. Let us now add xg = / to each equation which 
contains only one of x; and xg, and let x, and xg also be augmented by /. 
Then 18 is increased by 2/; an additional equation x; + xg + xs = 18 + 21 
implies 8 + 3/ = 18 + 2/, 1 = 10, so that 18 is increased to 18 + 2/ = 38. 
Shapley ?) has also obtained the relation 


1 
Dyi > 2 D, (1 --) 
n 


5. A complete analysis for п = 5 
We begin with table I. 


TABLE I 


Essential saturated defenses for n < 5 


partition defenses 


1+1+1=3 
1+1+1+1=4 

2+1+1+1=5 

1+14+14+14+1=5 
2+1+1+1+1=6 
34+1+4+1+4+14+1=7 
2+2+1+1+1=7 
34+2+1+1+1=8 
3+2+2+1+1=9 


($, 2, 4) 
(0,3, 3), 6,4, +, 9 

@ 343 

G; 4, 4 11. G: +, 3 4, 1), 44449 
@ 31), @,Ь% 1} 

CEEE) 

G, $, $, $, 4), G, $, 4, 4, 4), b $, +, +, 4) 
@ % $44 

(4, $, 3, 4, 2), b ? ? t 1) 


We may now study the case of 5 properties in detail, and obtain a complete 
list of possible defenses and the range of h for which each defense must be con- 
sidered. Assume v, > v; > v4 > v4 > vs, and that the offense always takes 
the highest value it can obtain. A list of all defenses for n = 5 may be derived 
from all essential defenses for п < 5 by adding 1’s and 0’s. Each defense needs 
to be considered from the lowest Л for which it is possible up to the lowest A 
for a better defense, that is, one that is guaranteed to protect more property 
value. For instance, (4, +, 4, 4, 0) has A = 2. Therefore a defense that protects 
T, to T, equally in such a way that the offense can get only T,, and has not 
defended Т» at all, is possible for / > 2, and guarantees that the offense gets 
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no more than v, + v. For 2 << h < 2:5 there is no better way to defend the 
properties. At h = 2:5 the defenses (1, 2, 3, 2, 0) and (4, 4, 4, 4,4) become 
possible, each of which guarantees the defense better success (loss of only 
V, + v; and v,, respectively). Notice that in the preceding examples, the order 
of the v, uniquely determines which T, the offense will attack. For some 
saturated defenses, this is not the case. For instance with h = 1:4, (2, 2, +, +, 1) 
allows the offense his choice of v, + v; or v, + оз + v4, and either total value 
may be higher. Only saturated defenses of the form 


where some of the k, may vanish, are "pure" in the sense that they lead to 
unique determination of the T, which the offense will attack. Figure 1 (see the 


Fig. 1. Fraction of total property value saved by the defense. Drawn curve: all saturated 
defenses; dashed curve: only pure defenses. Property values: (17, 12, 8. 7, 6). 


end of this section for details) shows, for a particular set of v, the difference 
between considering all strategies and only “pure” strategies (for which the 
theory is easy). 

From table I we may now compile the corresponding complete list of un- 
dominated strategies for n — 5, together with the total ^ required, and the 
properties which will be taken. Ап entry such as “12 or 134" means that the 
defense, although undominated, is not pure: the offense will obtain either 
V, + v4 Or v, + v3 + v, at its discretion. 


defense 


(1, 0, 0, 0, 0) 
(Ф, 4, 4 4, 2) 
(3, 4, 3, 4, 0) 
(62,2, з) 
(5, 4, 4, 0, 0) 
G 4, 4 4; 4) 
@ 5h53 
G: 4, 4, 3, 0) 
4,3,4,4, 3) 
G, 4, 4, 4, 4) 
G, 4. 2» 5 $ 4) 
(8, $ 5? $ t 4) 
(1, 1, 0, 0, 0) 
G: 4, 4, 2, 0) 
G: 53,53 
Ф, 4, 4, 4 Ф) 
(1,4, $, $ 3) 
G: 4, 3, 5, 3) 
(1, 4, 4, 5, 0) 
@ 115) 
1,3, 3,4, 3) 
УК 1, 1, 0, | 


necessary й = 
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TABLE II 


Saturated defenses for n = 5 


properties lost 
(v, 2 v2 2 оз È V4 2 Vs) 


2345 
123 
125 

12 or 134 
145 
12 or 234 
13 or 234 
15 or 235 
12 
1 or 234 
13 or 345 
14 or 23 or 245 
345 
15 
1 or 23 
1 or 24 
23 
1 or 34 
25 


none 


Using table II we may now make a plot of all the saturated defenses which 


must be considered for any given value of Л. For instance, (4, 4, 4, 


4, 0), which 


requires h > 1-3 and gives the offense 125, is poorer than (4, 4, 4, 0, 0), which 
requires Л >> 1:5 and gives the offense 145. However, (4, 4, 4,4, 0) is not 
poorer than any strategy between these two in table II, and hence must be 


considered for 1:3 < А < 1:5. 


The plot allows us to find all defenses that must be considered for a given 
range of Л. For instance, if 1-4 << h < 1:5, then the defense has the choice of 
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giving the offense either 2345, or 125, or the offense's choice of 12 or 134. 
The corresponding strategies, namely (1, 0, 0, 0, 0), ог (4, 4,4, 5,0), or 
(2, 2, 4, 4, 1), are found in table П. 
14 or 23 
or 245 
' 1 
(20r 343, 
Lu ог 236 oe 
'_12 23 ^u 
12ог 12or 130. en, 1 ог tor dori 
134 234, 234115 or 235 ' 23 2h 1341 


123 UE | 145 (5 


125 f 
— —» Defensive force h 


Fig. 2. Properties lost by saturated defenses; п == 5. 


Using the plot of fig. 2 the effectiveness of various total defensive strengths 
was studied for a particular set of 5 properties. In the chosen example, the v, 
are given by (17, 12, 8, 7, 6), so that the total value of all properties is 50. 
Figure 1 shows the proportion of total value saved by the best appropriate 
defense as a function of total defensive strength Л. The dashed curve gives the 
best that can be done by pure defenses alone, and shows the value of the more 
complicated defenses. 


6. Undominated defenses 


In the case of 5 properties, each of the saturated defenses is actually the 
unique best defense for some set of values (v,, ..., v,) and some A, in the 
sense that no other defense could equal its effectiveness. It is an immediate 
consequence of theorem 1 that a defense which has this property, i.e. it is un- 
dominated, is a saturated defense. On the other hand a saturated defense is not 
necessarily an undominated defense as is shown by the following examples, 
which are alternate versions of an example due to N. G. de Bruijn: 

(а) (4, 2, $, 4, 4, 4) and G, 4, 3, 4, 4, 3) are both saturated (both with total 
strength h = 3). The first one is dominated by the second one. 

(b) (2, 4, $, 3, 4, 4, 4) with Л = Y is saturated but dominated by (5, 4, 4, 4, 
4,4,4) with h =4. 

We remark that in any saturated defense (k4, h2, ..., An), any element hy 
can be repeated an arbitrary number of times (say s) thus leading to a saturated 
defense on n + 5 — 1 properties. If /, is not of the form m=! (me №) or 0, 
then this saturated defense will in fact be dominated if s is sufficiently large. 
This is our understanding of De Bruijn’s example. 
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7.. Concluding remarks 


For the benefit of any readers who might develop a further interest in the 
problem, it is perhaps worth recording two other conjectures suggested by our 
work that turn out to be false: 

(a) Any saturated defense contains repeated values of h; 
Counterexample: (ў, $ 5, ip 5, 2, д), or the example for D; at the 
end of sec. 4. 

(b) Any saturated defense written with lowest common denominator D must 
contain the element 1/D. 
Counterexample: (5, 5, 5, ть тр 5, & 2, 3). 

At present the authors are studying the similar problem for the situation that 
the offense does not have knowledge of the assignment of the defense. The dif- 
ference in values of the two games will give some indication of the value of 
“inside information”. For example, it is not hard to show that if n = 2 and 
3 < h < 2 the expected value of properties taken by the offense is 


V; V; (v + ¥2)74, 


whereas in the game we have treated the offense gets v;. Thus the value of 
the offense's advance knowledge of the defense is v5? (v, + v;)^ !. 

Finally, we note that, mutatis mutandis, our results also apply to the fol- 
‘lowing game with defense-last-move: Defense has total strength h = 1, offense 
total strength b >1, defense divides forces after seeing how offense has 
chosen to divide forces, properties lost if b; > /;. In this game, the saturated 
offenses correspond to the saturated defenses we have studied, and sets of 
properties lost are the complements of the sets we have found. 
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1. Introduction 


This paper will be concerned with radar scattering from the sea when the 
wavelength is large compared to the amplitude of the sea surface disturbance 
and the antenna is located on the surface. It has been observed, both theoreti- 
cally and experimentally, that under these conditions, for vertical polarization 
of the radiated field, the return is due primarily to a single spatial frequency 
component of the sea surface and is much larger than would be expected from 
an estimate based upon summing the returns which might be attributed to the 
individual water wave peaks +). 

This phenomenon has been explained as a type of Bragg resonant scattering 
which occurs in the backward direction when the sea surface wavelength is half 
that of the radar ?). It can be seen, in fact, from an examination of the results 
of Rice °), that when this resonance condition is satisfied an incident plane 
wave excites an electromagnetic surface wave of very large amplitude propagat- 
ing in the backward direction. 

Barrick and Peake, by integrating the effects produced by Rice's model of a 
slightly perturbed surface, have calculated tbe average scattering cross-section 
per unit area со for the sea ^). Barrick ?) has shown some agreement with 
experiment for his result using the statistical model of the sea surface given 
by Phillips 5). However, there appears to be some reason for concern about the 
validity of characterizing HF radar return from the sea by means of a scattering 
cross-section since the distance between the illuminated area on the sea surface 
and the antenna will usually be too small for the validity of the radar range 
equation; i.e., while the scattering region may be in the far field of the antenna 
the antenna will often not be in the far field of the illuminated sea surface area 
at these wavelengths. A calculation of the radar return valid for shorter distances 
would, therefore, appear to be useful. 

In the following a variation of a method due to Wait *) is used to calculate 
the return from an abritrary small antenna located at the sea surface at distances 
from the scattering region just large enough so that the scattering region lies 
in the far field of the antenna. Specifically, it is assumed that the distance 


NEAR AND FAR FIELD НЕ RADAR GROUND WAVE RETURN FROM THE SEA 57* 


between the scattering region and the antenna is large compared to the radar 
wavelength which is, itself, large compared to the sea surface disturbance ampli- 
tude, but not necessarily large compared to the width of the illuminated sea 
surface area. The formula for the radar return derived in this calculation shows 
that Barrick's result is essentially correct, modified slightly by the antenna pat- 
tern and the directional behaviour of the spatial power spectrum of the sea 
surface, for very large distances compared to the width of the illuminated sea 
surface. However, when the width of the illuminated surface is large Barrick's 
result can be greatly in error. 


2. Reflection coefficient 


It will be assumed that the amplitude ¢ of any disturbance of the sea surface 
relative to a completely calm sea, which is idealized here as an infinite perfectly 
conducting plane *), will be small compared with the radar wavelength A. It 
will also be assumed that the surface gradient at any point in the disturbance 
is small compared to one. Thus, if the undisturbed sea is the plane z — 0, in a 
cartesian coordinate system oriented so that distance above the sea is measured 
in the z direction, and 


z = C(x, y) (1) 
represents the disturbance, the assumed conditions are 
<A, 
| (2) 
|vz| « 1. 


The antenna is assumed to be small enough so that the current can be regarded 
as constant over the gap and no important contributions to the source exist 
outside the gap. It is also assumed that the antenna is located in the plane 
2 = 0 and is oriented so that the electric field is vertically polarized. 

An approximation to the reflection coefficient due to a sea surface disturbance 
satisfying conditions (2) in the presence of a small antenna can be obtained by 
means of a technique of Wait *). For the present purpose the necessary expres- 
sion is most easily derived from a form of the so-called compensation theorem 
which follows immediately from the identity 


У .(ЕхН) —V.(ExH)—- E .J—E.J', (3) 


where the field (E,H) is due to a current density J, and the field (E’,H’) is due 
to a current density J', and the fields exist in otherwise identical media although 
they do not necessarily satisfy the same boundary conditions. If J is set equal 


*) According to Wait !) and others, under the conditions assumed here the sea can be 
regarded as a perfect conductor with small error. А first-order correction can be obtained 
by multiplying the radar return by a Norton ground wave attenuation factor. 
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to J’ and (3) is integrated over all space above the ground plane z = 0, the 
result will be i 

Z-z) = | [ Ex —Е'хн), ах dy, (4) 
where Z апа Z’ are self impedances at the current source and J is the total 
antenna current. 

If the primed fields are those produced in the presence of a perfectly con- 
ducting disturbed ground plane surface given by (1) and n is a vector normal 
to that surface, then 

nxE' = 0, (5) 


on the surface. From (5) it follows that 


og 
Си + Е’, = 0, 


y 
og 
— Е, + E'X = У, (6) 
ox 
oc oc 
—Е', == — E', 
ox oy 


The third equation in (6) is not independent since it is actually a consequence 
of the first two. 

If the unprimed fields are those which occur in the presence of an undisturbed 
perfectly conducting ground plane, then because of (2) 


oc 
E', x — — Е, (7) 
oy 
and 
әб 
Е'„ аш — — Е, 
ox 


to the first order in the perturbation. 
For a sufficiently large distance from the antenna on the undisturbed ground 
plane | 


E’, © —]о Hg, (8) 


where 7p is the impedance of free space and H, is the azimuthal component in 
polar coordinates of the magnetic field at the surface z = 0. In the far field 
the magnetic field has no other component since it is orthogonal to the electric 
field and to the radial direction, from which it follows that to first order in the 
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perturbation the vector Ex H' in (4) has no z component and hence does not 
contribute to the integral. 
Then from (7) and (8) applied to (4) 


, No 2, 
2—7 a ds, (9) 


which provides an approximate expression for the change in self impedance of 
the antenna due to a disturbance in the ground plane surface. The variables r 
and o are coordinates in a polar coordinate system centred at the antenna, and 
I'is the illuminated disturbed part of the ground plane and is assumed to be 
finite and far from the antenna. In practice this isolation of Г can be accom- 
plished by using a finite pulse and range gating the return, which may then be 
regarded as coming from a well defined range cell, i.e., the region Г. 
The far field, e.g., at Г, has the form 


-ikr етік 


Нот Нр и) =, 
т (10) 
e^ r 


E. ~ No h(¢) c 


If the antenna is matched to the undisturbed ground plane the impedance Z 
seen at the antenna in the absence of a surface disturbance consists entirely of 
its radiation resistance which is given by 


Z= E | | Hg, 0) ао, (11) 


where Q is the surface of a unit sphere. The relation (11) is just a statement 
of the fact that the total radiated power is equal to Z 12, which is equivalent 
to a definition of Z in this case. From (11) and the definition of antenna gain G, 
then, 

4z No Ho? 


, 12 
rm (12) 


where 
Ho = H0, Зл) = A(0). 


The reflection coefficient R due to the disturbed ground plane when the | 
antenna is matched to the undisturbed ground plane is given by 


Z—Z Z—Z2Z 
- we, 13). 


R= 
2+2 22 
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where the fact that for a small amplitude disturbance Z’ ~ Z has been used. 
Then from (9), (12) and (13) it follows that 


G d 
R w—-— | 8 ыз ds. (14) 


3. Average reflected power 


It will be assumed that the sea surface disturbance is a stationary random 
process so that the autocorrelation function 


KG; r,r) = €@ Cr) 
is a function of the form 


KE; r,r’) = C; r — r’), (15) 


i.e., a function of the difference r — г’. The power spectrum of ё is the Fourier 
transform of C(£; r) and will be designated by S(5; x) which is a function of 
the wave number vector x. The power spectra of the derivatives £, and £, are 
x? S(Ċ; x) and x,? S(C; x), respectively. For convenience, when there is no 
danger of confusion ¢ will be left out of the arguments of S and C. 

Since 


oc | 
— = (,cos ф + 6, sin 0, 
or 
it follows that 
K(é,3 r, Y) = C(x; r —т') cos pcos ф' + C(b5;r—x)singsing', (16) 
which has the form 
2 
Kéne r) = У Сбт) AOAC), (7) 
j=1 
where 


Л@ =Л(@ = cosp, Л) = (9) = sinp, Gi) = C(b;r), 
and 
G(r) = C(5; r). 


The average value ЕЕ of the reflected power, according to (14), is 


Lire | | | GE) C )H,() Hy*2(r') dS 45. (18) 
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Because of (17) this has the form 


where 
Hy) = fi) H^. 
If the Fourier transform of G(r) is designated by 5 (х), then, with reference 
to (17), it can be seen that 
S1(x) = x,? S(x) = x? cos? o, S(x) 
and | (20) 
S2(x) = x? S(xj) = x? sin? p, S(x). 


From the definition of the power spectrum it follows that (19) can be written 


Шс ПЕС J T | Гері. гну G6) asas аи 


2 
G? y 
= x HS | | S) |1009]? d?x, (21) 
jai 7-9 f . 


where 


L) = | | exp ({х.т) Нут) dS 


Р exp (—2ikr) 
=| ——— j; SAPP expli Ge cos р + sin Ф) арал. (22) 


Dy 
In (22) D, and D, are the radial limits of the range cell defining Г. 

Since r in (22) varies over distances which are all large compared to the radar 
wavelength, the angular integral can be evaluated with good approximation by 
the method of stationary phase except for values of x <k. It is easily seen 
that the one stationary point фо which occurs within the integration region is 
given by 

Фо = arctan (х,/х„) = Pu 
Then, 
2л \ 4/2 "exp [i (x — 2k) r] 
I(x) & (=) exp (—i 2/4) f(x) h (px) [ INT aaa dr. (23) 


Di 
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For 
D= OR -+ D2), 


the mean antenna distance, and 
L = р, —D,, 


the width of the range cell, (23) can be written 
1/2 
dx) = (=) Si Px) (p) exp {i [(% — 2k) D—2/4]) Fl(«—2k) L], (24) 


where 
exp (i u r/2) 


F(u) = | ————— dr 
a (r + 2D/L)*/? 


If L becomes large while D/L remains fixed, i.e., L and D grow at the same 
rate, then 


L|FI(« — 2k) пі дч J |F(u)|? du à(« — 2k), (25) 


i.e., for an arbitrary function w(x) 


L f w(x) | Fle — 2k) 11]? ах ~ w(2k) Í |FG[? du. 


-—00 -00 


By Plancherel’s theorem for the Fourier transform 


| |FG)|? du = 4л / ао 


-%0 


l6z D 
7 L(AD?]I? — 1 
Then, by combining (21), (24), (25) and (26), it can be seen that 


(26) 


22 


[RE Р (ost gu + sint p) HC) SRK, p) d от) 
х A ————— cos х Sin x, m » Dx xs 
с гарз — 1, Ф Ф Ф Фк) аф 
where 
D 
р = 
L 
and 
h 
H=—, 
Ho 
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which is the antenna – ground plane pattern normalized to unit amplitude in 
the forward direction. — 

The expression for |R|? given by (27) can be converted into one for the cross- 
section с by multiplying by the factor 16 л D^ k?/G? in accordance with the 
standard radar equation. Thus, 


64 2 D? k* рэ 7" - 4 
da aD ] (cost px + sin* px) H’*(px) SQk, gx) аф. (28) 
о 


The area A, illuminated by the radar beam if the beam width is f is given by 
Ag = B L D. 
Thus the cross-section per unit area со is given by 


2n 


б4р". [ (cost pu + sin* p) Нер.) SOK, р) ар. 09) 
= ———————— cos* m sin x x, и, х. 
арз —iy B, 9 9 9 px) dp 
If 
D'> 1, 
then 
As k* Ai 
Oo © (cost px + sint px) H’*(px) S(2k, px) dps. (30) 


0 
This result does not differ substantially from that given by Barrick, namely, 
Со & 4л kt S(2k, 0). 


The fractional error є created by using (30) instead of (29) is given by 


8D? —1 
С (@р®—1)° ` 


This quantity will be small only if р’ > 1. 


4. Conclusions 


The standard calculation of c, given by Barrick for HF radar scattering from 
the sea by a small antenna on the sea surface is valid if the scattering region is 
far enough from the antenna. However, if the distance between the scattering 
region and the antenna is not large compared to the width of the illuminated 
area, that calculation can be greatly in error. Since the correct value of со, in 
this case, is distance dependent the reflected signal power, given by (27), should 
be used instead of со and the radar range equation. 
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Dear Chris, 


I have always admired your critical no-nonsense attitude, in physics (and 
elsewhere), so refreshing in today's atmosphere of vague philosophizing. 
Although it would be misleading to claim that physics is an exact science, one 
has a right to hope that it is exact enough to be treated by logic and common 
sense. On the whole this is so, but some problems seem to touch a mystical 
chord even in physicists who are otherwise quite reasonable. One of those 
problems is the proper understanding of the role of the measurement process 
in quantum mechanics. I am insufficiently familiar with the extensive literature 
on this subject to write a formal article on it, but I am glad with this opportunity 
to tell you about a small calculation I have done. It may amuse you, as it brings 
some of the lofty discussions down to earth. 

Allow me a few not very original preliminary remarks to clear the way. First, 
quantum mechanics, as all physical theories, purports to describe physical 
phenomena. Physical phenomena are observed and registered by measuring 
apparatus such as dials, photographic plates, or on-line computers. The result 
is fixed in a fully objective way for everybody to see, e.g., by the blackening of 
the plate, or the figures printed in the Physical Review. Such are the physical 
observations that led to quantum mechanics and are successfully explained by 
it. However, in quantum mechanics the mathematical formalism is less directly 
connected with the observations than in classical theory. The aim of the theory 
of measurement is to elucidate this connection. 

The second preliminary remark is that quantum mechanics is used successfully 
every day, even though the controversies about the measurement process are still 
raging. (This situation is aptly characterized by Groenewold's saying: “Only naive 
people think that a science is based on its foundations”.) The physicist sets up an 
apparatus, throws the switch, and registers the result. Quantum mechanics 
provides a well-defined scheme for computing these results, accepted by all 


*) Most of the work was done during a temporary stay at the University of Texas at 
Austin, Texas, U.S.A. 
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parties concerned. It appears therefore that the problem of measurement is 
implicitly solved by the working men. All that is left to the philosopher is to 
analyze what goes on and to make the process more explicit. 

Why should this give rise to so many difficulties and controversies? The 
quantummechanical formalism employs wave functions as useful (though not 
indispensable) mathematical tools. The difficulties enter if one considers the 
wave function (or state vector) as more than a mathematical tool and attempts 
to attach a more concrete physical meaning to it. Admittedly the history of 
physics knows many examples of quantities that were originally introduced as 
rather abstract mathematical concepts and gradually became more concrete. 
Energy was at first just an integral of the equations of motion and later became 
so concrete as to be endowed with mass. However, there is no a priori or philo- 
sophical reason why a mathematical tool should be amenable to a more con- 
crete interpretation. It may or may not turn out to be possible and fruitful to 
bestow more properties on it than contained in its original definition. It is fair 
to say that in the case of the wave function such efforts appear to lead to incon- . 
sistencies and paradoxes. There is no logical objection against abandoning them 
and yet preserving the factual results of quantum mechanics. 

Somehow that seems to be unsatisfactory to a number of people, even though 
they readily admit that the phase factor of the wave function has no physical 
meaning, and though they never had any qualms about the vector potential in 
classical electromagnetic theory. Many attempts have been made to attach a 
more physical meaning to the wave function. My view is that anyone is free to 
invent physical interpretations of the wave function, but the burden of proof 
that they.are consistent and useful is on him. When his efforts lead to such 
flights of fancy as the many-world interpretation I beg to be excused. 

A third remark has to be made to stop up another source of confusion. When 
quantum theory is used to compute the outcome of a certain experiment or 
measurement, the result is often stated in terms of probabilities for the various 
possible outcomes to occur. Such a statement cannot be verified or falsified by 
a single observation — unless the statement happens to be that there is an over- 
whelming probability for one particular outcome (or one particular subset of 
all possible outcomes). Otherwise one has to repeat the observation many times, 
as is done automatically in a scattering experiment. The physical phenomenon 
one is interested in, and about which quantum mechanics can make a pre- 
diction, consists in the frequencies with which the various outcomes occur in 
this sequence of observations. The diffraction pattern observed by Davisson 
and Germer is an example of such a physical phenomenon. The word “prob- 
ability” merely serves to reduce the computation to a single particle scattering 
problem, but the objective, physically verifiable phenomenon consists of the 
diffraction pattern produced by the whole beam. The task of quantum mechanics 
is to describe non-stochastic, repeatable physical observations. The concept of 
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probability merely enters as a convenient tool at an intermediate stage of the 
calculation, just as the wave function. 

These remarks may be summarized. Quantum mechanics makes sure pre- 
dictions about macroscopic, objectively verifiable phenomena. Statements about 
the probable behavior of single particles are unverifiable, unless one repeats the 
observation so many times that the experiment is actually macroscopic. Con- 
siderations about the interference of the observer with the measurement have 
served their purpose of clarifying certain aspects of the quantummechanical 
formalism, but they involve an extrapolation of the role of observers, who in 
reality only deal with macroscopically registered phenomena. 

This point of view is sometimes slightly deprecatingly referred to as “Copen- 
hagen school" or as "positivism", but it appears to me simply as a common- 
sense description of the way in which quantum mechanics is actually employed 
in physics. Any extension of the role of the wave function or the probability 
concept beyond their use for predicting objectively verifiable phenomena be- 
longs to the realm of metaphysics. In particular, speculations about the wave 
function interacting with the consciousness of the observer are beside the point, 
because they do not clarify the connection between the mathematical formalism 
and the physical phenomena for which it has been invented. 


According to standard theory a system that is left alone (no interaction with 
external agencies) is described by a wave function 


y(t) = е8 (0). (1) 


A measurement of а physical quantity С corresponds to an operator С with 
eigenvectors x, and eigenvalues А, (which I take discrete and non-degenerate for 
convenience). If G is measured at time /, the outcome is one of the values 4,, 
each with probability 


ln | 02). 0) 


After the measurement the state of the system is no longer described Бу (1). 
Suppose the outcome of the measurement was observed to be Am. Then one 
knows after the experiment that the state of the system is described by the 
corresponding eigenfunction 7,. Thus for £ > t, 


y(t) = EED у, (3) 


This abrupt change during a measurement has been called “the collapse of 
the wave function”. 

Incidentally, if somebody has performed the measurement but did not look 
at the outcome, all he can say is that the wave function must be one of the Xns 
each with probability (2). This is entirely analogous to classical probability 
theory: before you throw a coin you know that the outcome is heads or tails 
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with probabilities 4, 4; after you have observed the result the probability dis- 
tribution is 1, 0; but if you do not observe the result the probability distribu- 
tion is still 4, 4. In quantum mechanics one calls this a “mixed state”, which 
may be formally described by means of a density matrix. It should be clear, 
however, that such a mixed state is no more an actual state of the system than 
that a coin can be in a mixed state of heads and tails. 

Returning to the collapse of the wave function (3), one may ask the following 
question. Suppose I regard the object system together with the measuring 
apparatus as one combined or total system. Suppose I could solve the 
Schrödinger equation for this total system. Then I can find exactly how the 
object system is affected by the interaction with the measuring apparatus. Does 
this lead, indeed, to the collapse of its wave function? My aim is to investigate 
this question with the aid of a simple model. 


As our object system we take a single electron, and as the measured quantity 
its position. This case has been discussed by Heisenberg in terms of his well- 
known gamma-ray microscope, in which a single photon was supposed to inter- 
act with the electron. His and similar discussions have been useful for clarifying 
the role and consistency of the indeterminacy principle, but they do not describe 
the actually measuring process. In fact, they lead to a paradox when the question 
is asked how the measuring photon is observed; that would require a second 
measuring apparatus, and so on, and the chain of measurements does not end 
until it has reached the brain of the observer. In line with our preceding dis- 
cussion, we emphasize that an actual measurement should lead to a macroscopic, 
objectively registered result (as stated repeatedly by Bohr). 

Hence the measuring apparatus must be a macroscopic system. How is it pos- 
sible for a single electron to affect the state of such a big system in a macroscopi- 
callyvisibleway ? This can be achieved if the apparatus is initially in a metastable 
state, whose decay is triggered by the electron. Actual devices, such as Geiger 
counters, bubble chambers and photographic plates are, indeed, prepared in a 
metastable state. This also explains how a result can be objectively registered: 
once the system has made a transition from a metastable into a stable state it 
is no longer disturbed by observing it. 

In the preceding paragraph the word “state” means the macroscopic, thermo- 
dynamic state of a many-body system, which consists of an immense number of 
microscopic quantummechanical states. I shall use the words “macrostate” and 
“microstate” to emphasize this distinction. The classical nature of the apparatus 
appears in the fact that we can never say with certainty that it is in one of its 
microstates. (I have copied this sentence from Landau and Lifshitz, adding only 
the word never!) Note that the irreversible behavior of many-body systems is 
essential for making objectively registered measurements. 

In order to construct a soluble model of a macroscopic system containing a 
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metastable state we shall have to reduce it to its bare essentials. Rather than the 
AgBr crystals of the photographic plate I take a single atom having a ground 
state with zero energy and an excited state with energy О. It interacts with the 
radiation field, but I suppose that the matrix element of the dipole moment 
between the two states vanishes. The measuring apparatus consists of this atom 
plus the photon field; the macroscopic character is simulated by the many 
degrees of freedom of the field. As initial state of the apparatus take the atom 
in its excited state and no photons present — which is clearly metastable. The 
irreversibility is brought about by the fact that once a photon has been emitted 
it cannot be reabsorbed. 

To construct the Hilbert space of the measuring apparatus one has to take 
all microstates in which the atom is either in the ground state or excited, com- 
bined with all possible photon states. For our purpose, however, it is sufficient 
to take the microstate |2; 0) with excited atom and no photons, and the 
microstates |0 ; КУ with atom in the ground state and one photon К. АП these 
microstates are mutually orthogonal and span a subspace of vectors 


V = p |2; 0) + У y |0; k), (4) 
k 


with coefficients Ф, y,. This subspace is now combined with the electron 
Hilbert space by taking for o, y, functions of the position r of the electron. 
(Polarization of photons and electrons are ignored for simplicity.) The Hamil- 
tonian for the total system is in obvious notation 


Q 
&-( о) t Ekara y, (5) 
00 k 

In order to introduce an interaction between the object system and the 
measuring apparatus suppose that the presence of the electron perturbs the 
atom so as to create a dipole moment with a matrix element u(r) depending 
on the position of the electron. The precise function и is not important, provided 
it vanishes outside a certain neighborhood U of the atom. The extent of U 
determines the precision of our position measurement, and the magnitude of u 
is the efficiency of the apparatus. For convenience suppose u real and take the 
position of the atom as origin of r: 


È i Vr (ак — а). (6) 


k 


H, --P.20) =~( : p 


u(r) 0 


' The coefficient v, contains the normalizing factor (4 k/V)'? of the field 
modes, V being some volume in which the field is enclosed. In addition v, 
contains a damping factor to exclude wavelengths shorter than the diameter 
of the atom. The Schródinger equation for the wave functions (4) of the com- 
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bined system is now readily found to be 
i ġlr, t) = (0—3 V?) oft, t) — iu(r) У vi (т, t), (7a) 
k 


ip,(r, t) = (k — 3 V?) pl, t) + iv, u(x) gk, t). (7b) 


We are interested in that solution that for t — —оо takes the form: y, = 0 
and g is some incident wave packet 


gr, t) = f AQ) exp [i p.r — i ($ p? + ©) t] dp. (8) 
The final state for t > со will have components ф(т, t) and »,(r, г) such that 


lor, D| = probability density for finding the electron at r without having 
triggered the measuring device; 

lyr, t)|2 = probability density for finding the electron at r, having triggered 
the atom into emitting a photon k. 


If one sees a photon, or if one sees that the atom is in its ground state, one 
knows that the electron has been in the region U. Such knowledge can be used, 
for instance, to determine whether or not the electron passed through a given 
hole in a screen, and may therefore be regarded as a position measurement. The 
final stable macrostate of the apparatus consists of an immense number of 
eigenstates labelled by k, between which the observer cannot and need not dis- 
tinguish. 

There is no need to solve the Schrödinger equation (7) explicitly to see the 
wave function collapse. The point is that if the electron is not observed, i.e., if 
the atom is not de-excited, the state of the electron is described by g(r, t). 
This is the original wave packet (8), slightly modified by elastic interaction with 
the atom. However, once the electron has betrayed its presence in U by de- 
exciting the atom, it is henceforth described by the components y,(r, t) of 
the total wave function (4). 

The general form of these functions can be gleaned from (7b). Set temporarily 
p,(r, t) = е y(r, t), so that y, obeys 


PX. +$ V? gy = ie" v, u(r) фт, t). (9) 


The left-hand side has the form of the Schrédinger equation for the free elec- 
tron, and the right-hand side constitutes a source term. Thus х, (г, t), and 
hence also (т, £), is a wave emerging from the region U. As our measurement 
consisted in determining whether the electron is in U, this is exactly what one 
would expect according to (3). The essential point is that in the dynamical 
description of the total system the collapse of the wave function for the object 
system alone is due to the fact that the object system is described by a different 
component of the total wave function as soon as the measuring apparatus has 
been set off. 
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However, there is a complication. Rather than a single wave function for the 
electron after the measurement, we have found as many as there are photon 
states k. They belong to mutually orthogonal vectors in the Hilbert space of 
the total system. If F is some operator acting on the electron variables, its 
expectation value after the measurement is 


CE |Е| 9» = (o |F| ф) + X (ук |F| v9). 


where the round brackets indicate integration over the electron coordinates. 
The first term refers to the possibility that the electron has not been observed 
in U. The second term is a sum over all microstates in which the electron has 
been observed, leaving the measuring apparatus in one of the microstates that 
together make up its final macrostate. Hence the electron by itself cannot be 
described by a single wave function but the ensemble of functions s, is needed, 
i.e., a density matrix. This is not an artifact of our model, but a necessary con- 
sequence of the macroscopic nature of the measuring apparatus. Incidentally, 
the y, as functions of r have no reason to be mutually orthogonal or normalized. 

In order to study the properties of this ensemble in more detail take for A(p) 
in (8) a sharply peaked function around some po, so that the incident wave 
packet is virtually a plane wave with well-defined energy E = 5 ро? + Q. 
Let Ar be a rough measure for the diameter of U in the direction po. Then 
Дро = At is the time during which the electron interacts with the apparatus 
and induces a dipole moment in the atom. On the other hand, the decay time 
of the atom is of order (u? 0*)7!, where u is an average strength of the dipole 
moment. 

First suppose that the atom has ample time to decay 


Ar 1 


In that case the frequency k of the emitted photon is equal to 2 (within the 
natural line width). If we therefore substitute 2 for the quantity К in (7b), 
or in (9), it appears that all ү, (т, t) have the same form, apart from the factor v,. 
We denote the common part by y’, 


Pull, t) = №, v'(r, t), 


with a normalization factor N. Then the expectation value of F, conditional 
on the electron having been observed in U, is proportional to 


Y (ук |E] v) = N* È ve) (р |F] v. (10) 


N is determined by the requirement that y’ should be normalized for t — оо, 


№ Y vy? = У, [pH оо) | v. (4-99)] = 1 — [g(4- o) | g(4- 991. 
k k 
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Thus the behavior of the electron after having been observed in U is virtually 
determined by one wave function y': the ensemble mentioned above virtually 
reduces to a pure state. In this case one actually has a collapsed wave func- 
tion y' as anticipated in (3). 

Next suppose that the passage time At = Ar/po is no longer large com- 
pared to the decay time of the atom. Then the photon emission process is 
interrupted after a time At, which according to Lorentz gives rise to a line 
width of the emitted radiation 


Ak ~ 1/At. 
This is reflected in an energy spread of the electron after the measurement: 
| AG p?) At ~ 1, (11a) 
which may also be written in the familiar form 
Ap Ar ~ 1. (11b) 


To be precise, however, one should realize that this is not quite the same 
as Heisenberg's uncertainty relation. His relation is a feature of Fourier trans- 
forms: in order to construct a wave packet confined to a region Ar one needs 
all wavenumbers of an interval Ap. Our relations (11) refer to the spread of p 
values in the ensemble of wave packets p,, due to the fact that the electron has 
exchanged an unspecified amount of energy with the measuring apparatus. 

The upshot is that if the duration of the measurement is At, the ensemble of 
wave functions y, extends over an energy range 1/41, assuming that the energy 
of the incident packet, and hence of the total state Ў is sharp. When Ar is very 
large there still is an ensemble with an energy spread due to the natural line 
width u? 2? of the atom. This may be regarded as a consequence of the 
property of the apparatus of reacting in the time (и? Q?)71, even if the pas- 
sage time Л is longer. 

As a last example let me consider the case of a very short passage time, 


po? ltl, poAr<i. (12) 


That means that U is small compared to the wavelength of the electron. Hence 
we expect to find that there is no correlation between the momenta of the elec- 
tron before and after the observation. Owing to the inequality (12) the equation 
(7b) may be written 


ivy -H G V? — k) y, = iv, u(r) GO, г). 
Substitute for o the unperturbed plane wave 


p(t, t) = exppo.1 —i Et) E=tp?+2. 
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The resulting Born approximation for y, yields 


Uk Ho 


Фит, t) = san {ir [2 (E— 5]? —i Et}, (13) 


zr 


where ио is the space integral of u(r). . 

The time factor simply shows that we are dealing with a solution of the total 
system with a sharply defined energy E. The momentum values that occur in 
the ensenible are given by the energy left over by the photon. АЦ values less 
than (2E)!/? occur with relative probabilities v,?. Thus their probability dis- 
tribution is solely determined by the coupling strength of the various photons. 
A more exact treatment shows that this approximation is justified provided 
that the damping factor in v, vanishes before К reaches the value E; hence no 
imaginary momenta occur in (13). 


The conclusion is that the collapse of the wave function of the object system 
can be understood as a dynamical consequence of its interaction with a macro- 
scopic measuring apparatus. The combined system is in a single pure state at all 
times, but the final state of the object system alone is in general a mixture. The 
reason is that the measuring apparatus ends up in a macrostate, which consists 
of many microstates between which the observer cannot distinguish. 
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Abstract 


In some scattering problems a certain role is played by an integral over 
the pair distribution function of the scattering system. On the other hand, 
the energy of a dilute electron gas may be expressed in terms of a slightly 
different integral over the pair distribution function. The close relation 
between these two integrals is pointed out and the question of the system 
configurations for which these integrals would attain their maximum 
value is discussed. 


1. Introduction 


In this note, dedicated to Dr C. J. Bouwkamp on the occasion of his sixtieth 
birthday, I would like to draw attention to a remarkable relation between two 
apparently quite different subjects in theoretical physics. In each field mentioned 
in the title of this paper a problem has been formulated and investigated in 
detail. However, there is no indication in the literature that the close relation- 
ship between these problems has been recognised. Let me add at once that, as 
far as I know, the problems have not been solved completely and rigorously, 
so that there remains a challenge for people who, like Bouwkamp, combine 
physical insight and mathematical skill. 

The problems can be stated in terms of the well-known pair distribution 
function g(r), which was introduced in the statistical description of a medium 
consisting of a very large number of identical molecules by Ornstein and 
Zernike +) in 1914 and applied to the theory of scattering of X-rays in liquids 
by Zernike and Prins 7) in 1927. It is defined in such a way that o g(r) d?r 
represents the probability of finding in a large system in equilibrium a particle 
in volume element d?r at vector distance r from a given particle, where o is the 
number density (average number of particles per unit volume). In an ideal gas 
g(r) = 1, in dense gases and liquids g(r) will depend only on the distance r 
and will approach 1 for large r. In a crystal g(r) will of course depend on the 
-direction and for an ideal lattice can be represented by a sum of delta functions 
(cf. (9a) below). 

We shall show that in the two fields mentioned a certain role is played by 
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the following integrals: 


d?r 
r= [ü—291— (1a) 
and 
d?r 
J= [1 —(ю1— (1b) 
r 


respectively and the problems referred to above may be expressed as follows. 
If in the integrals J and J we take the cube root of the volume per particle as 
the unit of length, what is the maximum value that J (or J where appropriate) 
can attain? Or, in different words: if we consider systems of identical particles 
of given density, which structure leads to the maximum value of J (or J)? 


2. Scattering cross-section in'a dense medium 


Consider a large system of identical particles (nuclei, atoms, molecules), 
whose positions are given by r,. А monochromatic plane wave, e.g. an electro- 
magnetic wave (visible light, X-ray) or a wave of monoenergetic slow neutrons, 
with incoming wave vector kp is elastically scattered with outgoing wave vector 
k (k = ko). If the scattering amplitude for scattering by one particle is called 
A(0), where 8 is the scattering angle, then the amplitude of the wave scattered 
by the whole system is given by 


A(0) У exp (2% i h . r;). (2) 


The exponential factor takes account of the path differences between waves 
scattered by different particles: 


2 
2% h = ky — k, hc nnde 


The differential scattering cross-section per particle in this "static" scattering 
theory (where the particles are assumed to be fixed and energy transfer to the 
medium is neglected) is then given by 


o(h) = |4(9)? ( 1 + У exp [z ih. (г, — >) 


nei 


an expression which finally should be averaged in configuration space. With the 
above definition of the pair distribution function this leads to 


oh) = |49|? ( +e / [g(r) — 1] exp Qz ih . r) er) (3) 


which is the formula quoted by Zernike and Ргіпѕ 2). The differential scat- 
tering cross-section of a single particle |40)? depends of course on the 
nature of the scattering process. In the case of slow neutron scattering by 
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nuclei 4(0) = a, independent of the scattering angle, where a is the so-called 
coherent scattering length. In the case of scattering of unpolarized light by an 
isotropic fluid, then according to Einstein’s theory 
л? 1+ cos? 0 /de\? 
MOP = ———— — (<= : 
At 2 do 


where e is the dielectric constant at the frequency of light. In this case the 
wavelength A is very large compared with the range of g(r) — 1 (except in 
the neighbourhood of the critical point). As a consequence the exponential 
factor in the integral in (3) can usually be omitted in the case of light scattering. 


(4) 


Hence 
(h) coe (=) (14 fi T ) T 
o(h) = — ————— | — т) — r), 
At 2 do i Е 
while according to a well-known result quoted by Ornstein and Zernike !) 


1+ | i0 — 11 dr = ут, (6) 


where ут is the relative isothermal compressibility of the system, i.e. the iso- 
thermal compressibility divided by that of an ideal gas. 
In the case of scattering of unpolarized X-rays finally 


Yo ( e? ) 1 + cos? 0 RO () 
те? 2 | 
where F(h) is the (atomic) form factor. 

In all cases mentioned the differential cross-section per particle divided by 
the cross-section of a single particle is given by the structure function 


Sh) = 1+ e | 180) — 11 exo (27 i h . r) dr, (8) 


where as in previous formulae the integration is to be extended over all space. 


In the particular case of a Bravais lattice 
g(r) = vo У ô — R,), (9a) 
n#0 


where the summation is over the lattice sites and оо (= 1/0) is the volume of the unit cell. 
Then as a consequence of the well-known relation 


1 
Y exp Qz i h. R) = — У ô(h — hb), 
n Vor 


where the summation over A is over the sites of the reciprocal lattice, we have in this case 


1 
Sh) = — У, d(h—hy), (9b) 
Vo Axo 
which describes the Von Laue-Bragg scattering of X-rays and neutrons. 
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We now wish to determine the total scattering cross-section of a randomly 
oriented system, where for simplicity we restrict ourselves for the moment to 
the case in which |.4(6)|? is independent of the scattering angle 0. Hence the 
following derivation holds without further correction for the case of slow 
neutron scattering only. We must now average over all directions of h and 
integrate over all scattering angles. If we measure the total cross-section in 
units of the total rather than the differential cross-section of a single particle, 
the second integration can also be replaced by an average. It is convenient to 
carry out both averaging processes together. We have thus: 


10, 
4m ` 


big = f i d(cos 0) f Sh) 


From й = (2/4) sin 20 it follows that d(cos 6) = —A? h dh and hence, 
2 2/4 


бш = — | h dh | Sh) dQ, 


dI p ? [n say Ë ios 
pub rua cc (10) 


where the integrals are to be extended over a sphere of radius 2/4 in h-space, 
which we indicate by the suffix B. Making a Fourier transformation to r-space, 
it follows from (8) that we may also write: 


i A? о J TTE 4лг\ d?r i 
cai „=. — cos — н 
E 8л: e ( ш A ) zr? QD 


In the case of light scattering by an isotropic fluid far from critical conditions 
we have seen that S(h) could be replaced by S(0) and as a consequence the 
relative total cross-section is now 


бы = 50) = 1- e f 1809 — 11r = ут (12) 


according to (6), a result which of course could be obtained also from (11) by 
expanding 1 — cos (4zr/A) and retaining only the first term in the expansion. 
On the other hand, in the case of scattering of light by a fluid close to its critical 
point (a topic of current interest) the range of g(r) — 1 becomes very large, 
the relative compressibility ут — oo and the scattering becomes very intense 
(critical opalescence). Then it is no longer permissible to replace the factor 
exp (2x ih . r) by unity and we must go back to (11), or rather to a similar 
but slightly more complicated expression as a consequence of the fact that 
KOK is angle-dependent owing to the presence of the polarization factor 


а + cos? 6)/2 (cf. ед. (4)); see for example Rosenfeld 3). 
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Let us now consider the equivalent expressions (10) and (11) a little more 
closely. Under certain conditions it is permissible to extend the integration in 
(10) over the whole of h-space or — what amounts to the same thing — to 
leave out the term cos (4zr/A) in (11). This holds in the case of slow neutron 
scattering for short wavelengths (ie. 2 5 1 A). This is because for large Л 
(i.e. 2/2 ѕіп 30 small compared to interatomic distances) interference effects 
become negligible and o(h) tends to unity *). It also holds in the case of light 
scattering close to critical conditions where g(r) — 1 has a very long range 
and changes slowly over distances comparable to 4. 

In these cases we have, asymptotically, 

d? 


gO. (13) 
Tr 


(оү = 1— 


For critical light scattering we may neglect 1 compared to the second term апа 
we notice an important change in wavelength dependence of the total scattering 
cross-section. Under ordinary circumstances it is inversely proportional to А“ 
(cf. eq. (4)); in the neighbourhood of the critical point this changes into a A~? 
law. This effect had already been noted by Ornstein and Zernike +). 

I have now explained why Placzek, Nijboer and Van Hove ^) in their paper 
on neutron scattering were led to investigate the properties of the integral 
(remember that 1/h is the Fourier transform of 1/zr?) 


d?r d? 
r= ] й—@—;= f 0 — st. (14) 


mentioned in the introduction, where now we shall again use the cube root of 
the volume per particle as the unit of length. Its value was discussed for dense 
media, i.e. for ideal crystals, for thermally excited crystals and for liquids. The 
discussion was based on the following mathematical transformation: 

Let us split J into two parts: I = J, + I, where 


exp (—лг?) 

I, = [t —891— tr, 
15 
— exp CAES п) 

n= [t1 — 
iF 
Transforming J, into h-space, we have with (8) and noticing that 
1 — exp(—ar? Ф(л1/2 h 
[ RET урата 207, (16) 
лг 


*) For crystals, this statement must be qualified, see ref. 4. 
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where 
D(x) = 22719 | exp (—t?) dt 


x 
is the error function, 


p (~x 


1/2], 
I= [u-n 8 702 A go 


dèr + f n — s —— 


— 1/2 }, 
=3— [ go PS a, -f S(h) а TM | a 


Since, as is evident from their physical significance, g(r) and S(h) are non- 
negative, (17) gives an upper bound for 7: J < 3. Moreover the deviation 
from the upper bound 3 may be expected to be small for dense media. This 
is because firstly g(r) = 0 for r < a where a is the distance of closest approach 
(which is of the order of 1 in our units), and secondly S(O) = xr (cf. (12)) 
and the relative compressibility is small for solids and for liquids far from their 
critical point. On the other hand it is obvious that J is zero for an ideal gas 
and further that it will become strongly negative for a fluid under critical con- 
ditions. 

The integral J was calculated for a large number of ideal lattices. This would 
have been extremely difficult from (14), but from (17) it is easy because the 
series (for lattices) in (17) converge very rapidly. 

I shall now quote a few results in table I. 


TABLE I 
lattice 1 
cubic face-centred 2-888462 
hexag. close-packed 2:888377 
cubic body-centred 2-888282 
simple cubic 2-837298 
diamond 2-693400 


In table I the lattices are arranged in order of decreasing closeness of packing 
(the upper two are both close-packed); the value of J decreases in the same 
order. It was also shown that thermal vibrations lead to a small decrease from 
the value of 7 for the corresponding ideal lattice and furthermore the value of 7 
for liquids under ordinary circumstances was estimated. The question of what 
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the maximum value of 7 would be and for what system it would be reached is 
raised in the paper *) mentioned above. I will not repeat the discussion given 
there; it was thoughf to be a rather difficult problem. Though it seemed likely 
that the value 2-888462 for the face-centred cubic lattice is the maximum pos- 
sible value, a rigorous proof could not be given. 


3. Energy of a dilute electron gas 


In the elementary electron theory of solids it is assumed that the valence elec- 
trons move independently in a periodic field. Attempts to take into account the 
interaction between the electrons in a more rigorous way originated in the work 
of Wigner ?). The problem is often simplified by neglecting the periodic struc- 
ture of the system and considering an electron gas moving in a uniform positively 
charged background in order to make the system electrically neutral. The elec- 
tron density о is usually expressed in terms of the dimensionless quantity r, 
defined by 


= $ n r? ao, (18) 


where ао = Й?[те? is the Bohr radius; i.e. r, is the radius (expressed in atomic 
units) of a sphere with volume equal to the volume per electron. Small r, means 
high density, large r, low density; for real metals r, is of the order of 5. 

In modern many-body theory refined perturbation methods have been suc- 
cessfully used in treating the interaction between the electrons. These lead to an 
expansion for the energy of an electron gas for small values of r, i.e. it is essen- 
tially a high-density expansion. In order to develop interpolation formulae for 
the case of realistic r,-values it is obviously useful to consider the low-density 
limit also, i.e. the case of a dilute electron gas (large r,). Wigner in his paper 
referred to above argued that in the limit of large r,-values the kinetic energy 
of the electrons, which is proportional to r,~?, could be neglected in com- 
parison with the potential energy (proportional to ғ; !) and he suggested that 
in a dilute electron gas the electrons would arrange themselves in a configuration 
of minimum potential energy, "probably a body-centered cubic lattice". 
Fuchs 9) performed an accurate calculation of the potential energy of an elec- 
tron lattice for the body-centred and face-centred cubic lattices and he found 
that the b.c.c. structure was indeed the more stable one, though the difference 
turned. out to be very small. 

Here I wish to point out that the potential energy of an electron gas in a 
neutralizing positively charged background can be expressed in terms of the 
integral J mentioned in the introduction (cf. (1b)). Indeed if the structure of 
the electron system is given by the pair distribution function g(r) then the 
potential energy per electron of such a system is obviously given by 
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where the cubic root of the volume per particle is used again as unit of length 
in the integral in the second line. 
From (92), (9b) and (14) it will be evident that the value of the integral 
3 


d 
72 |u- 


for a certain Bravais lattice may be found immediately from the value of J, if 
the latter is calculated for the reciprocal lattice. Indeed the function g(r) for a 
certain lattice has the same form (namely a sum of delta functions) as the 
function S(h) for the lattice reciprocal to it. Remember further that the recip- 
rocal lattice of a simple cubic lattice is again simple cubic, that of an fcc 
lattice is bcc and vice versa. 

In 1960 Coldwell-Horsfall and Maradudin ?) evaluated the potential energy 
of the three primitive cubic electron lattices in much the same way (though ex- 
pressed in less general mathematical language) as we calculated 7 in 1951 (and 
which I reproduced in sec. 2), apparently without recognizing the close relation 
between the two problems. I should add that we in our turn were not aware of 
the relevancy for our problem of Fuchs' calculation. 

Their results were (in Rydberg units e?/2ag): ' 


1 
Ey = —1:791860 — , 
rs 


1 
Еке = —1:791753 — , 


rs 


E. = 1.760119 — . 
: г, 

From the remarks made above it will be clear that these numbers might have 
been obtained simply from the numbers given in table I for the fcc, bcc and 
simple cubic lattices (in that order!) by dividing them by (4z/3)!/? = 1-611992. 
(Let me add that in addition to these results the authors mentioned ?) also 
calculated the correction (proportional to ғ; 3/2) to the energy due to the zero- 
point vibrations around the lattice positions.) 

We have seen that Iree > е, whereas Jg, < Jy. Noticing that the 
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Fourier transform of 73/2 is just h-?/? we may infer that the integral 


d?r 
[E-e 
has the same value for the fcc and bce structures. 

It was stated in sec. 2 that the question as to which structure (i.e. which 
function g(r)) would lead to the absolute maximum of J has not yet been solved 
rigorously. The same holds, as far as I know, with respect to the question of 
which structure of an electron gas in a compensating background has an abso- 
lute minimum of the potential energy (or a maximum of J), though a lattice 
structure would seem intuitively likely here. Whether a final solution to one 
of these problems would also solve the other one would seem to depend on 
the question of whether the functions g(r) and S(h) (both non-negative and 
S — 1 the Fourier transform of g — 1) also in more general cases than for 
Bravais lattices belong to the same class of functions. Or stated more explicitly: 
if g(r) is the pair distribution function of a certain system and S(h) is according 
to (8) the structure function corresponding to it, is it then always possible to 
consider S(r) as the pair distribution function of a “reciprocal” system? For 
the moment I would like to leave this question open. 
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1. Introduction 


Let us consider a medium of dielectric constant eo, magnetic permeability шо 
and electrical conductivity со containing a suspension of particles of different 
electromagnetic properties. We wish to calculate є*, и* and с“, the effective or 
bulk properties of the composite medium for static electromagnetic fields. We 
shall do so provided that the concentration of particles is so small that electro- 
magnetic interaction between them is negligible. Thus our result will be valid 
only up to and including terms of first order in the concentration. However, 
no restriction will be placed upon the electromagnetic properties of the par- 
ticles, so that they may be very different from those of the surrounding medium. 
Furthermore all the parameters of the medium and the particles may be tensors. 
The method to be used is that given by Landau and Lifshitz +) for calculating 
o*, An extension to higher concentrations is given in sec. 5. 

Corresponding results for high concentrations of perfectly conducting spheres 
or cylinders arranged in a cubic or square lattice have been given by the author ?) 
and numerical results for arbitrary concentrations by Keller and Sachs ?). 
When the media inside and outside the cylinders in a square lattice are inter- 
changed, the effective conductivity changes in a manner described by the 
author ^). 


2. Derivation of general expressions for є* and o* 


We begin by considering a large region .R of the composite medium having 
volume V and containing N particles. Let the region R be immersed in a uniform 
static electric field Е, and let the resulting electric field in R be E(x). Then the 
displacement D(x) and the current J(x) are given by 


D = г(х) Е (1) 
and 

J = o(x) E. (2) 
Here &(x) and o(x) are the dielectric constant and conductivity at the point x 


*) Research. supported in part by the Office of Naval Research under Contract No. 
N00014-67-A-0467-0015. 
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in the composite medium. Let 


f= ү-* | то) ах 


denote the average over R of any function f (x). Then we define є* and o* by 
the relations 


D = г* Е : (3) 
and 
J—c*E. (4) 
To determine &* we note that 


D— so E = V-! | ф—Е)дх 
R 


=V-! [ (е s) E dx 
/ 


N 


=y-} MIS (5) 


1=1 Rt 


In (5) we have used the fact that € = =, except within the regions Кү, i = І, 
‚..› N, occupied by the particles. Now if (3) is used for D, we can write (5) as 


e*—9E- nv C [eso Bax). (6) 
Here " 
(6) = М У а 


denotes the average of a function over the particles. 
Let us define p, the average polarizability tensor of a particle, by the equation 


pE={ ева) (7) 


Ri 
Then (6) and (7) yield the result 
et = eo + N V- p. (8) 
In exactly the same way we find 


of = 0o + N Y-1q. (9) 
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Here q, the average conductivity tensor of a particle, is defined by 


gE =( [ @—«) Bax), (10) 


If the region R is immersed in a uniform static magnetic field Не, we denote 
by H(x) the resulting magnetic field. Then the magnetic induction B(x) is given 
by B = u(x) Н and p* is defined by B = u* H. Proceeding as above we find 


и = uo + NVA m. (11) 


Here the average magnetizability tensor m of a particle is defined by 


mH =( одна). (12) 


Ri 


The results (8), (9) and (11) for e*, o* and и* together with the definitions 
(7), (10) and (12) of p, g and m are our general results. They show that the 
difference between any effective property of the composite medium and the 
corresponding property of the original medium is proportional to the number 
density of suspended particles. Its coefficient depends upon the entire particle 
distribution in general. However, it is characteristic of a single particle when 
the number density is small enough. This is the case to be considered in the 
next two sections. 


3. Determination of p, q and m 


To determine p and q we must consider a medium of properties eo, со and uo 
with a uniform applied electric field Ey and find the resulting field E(x). To do so 
we let с(х) and (x) denote the conductivity and dielectric constant of the par- 
ticle at the point x, if x is in R,, and of the surrounding medium if x is out- 
side R,. Since V . J = 0 for a static field, it follows from (2) that V . (о E) = 0. 
In addition V XE = 0 so E = V ф where the potential ф is a scalar. There- 
fore 


V.(cV9g)-0. (13) 


We recall that the tangential component of E and the normal component of J 
are continuous across surfaces of discontinuity of the medium (i.e. of o(x) and 
e(x)). This implies that ф and the normal component of o V ф are continuous 
across discontinuity surfaces. At infinity y must tend to the external potential 
Eo . x. Equation (13), these continuity conditions and the condition at in- 
finity suffice to determine g, and therefore E, provided o(x) 4 0. Then (7) 
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and (10) yield p and g. The additional Maxwell equation V . (s E) = o deter- 
mines the charge density o(x). 

If o(x) = 0 within any region Ro, (13) is identically satisfied there. In such 
a region g(x) can be prescribed arbitrarily and the equation V . (e V p) = о(х) 
must hold there. When о(х) = 0 in Ке this becomes 


V.(eV9)—0. | (14) 


Equation (14) in Ro and (13) outside Ro, together with the continuity condi- 
tions and the condition at infinity, serve to determine ф and therefore E. Then 
(7) and (10) yield р and q. If o(x) = 0 and o(x) = 0 everywhere, then Ке is 
the entire space and (14) holds everywhere. 

To determine m we consider a single particle in a medium of properties ге, 
со and ро with a uniform applied magnetic field Ho. To find H(x) we utilize 
the static equation V XH = 0 to write Н = V y where y is a scalar. Then 
the equation V . B = 0 yields V . Н = 0 from which we obtain 


V «(a V y) = 0. (15) 


The continuity of the tangential component of H and the normal component 
of B at discontinuities implies that y and the normal component of u V + 
are continuous. At infinity y must tend to Hy. x. These conditions and (15) 
determine y and thus H(x). Then (12) yields m. 

From the above considerations the following simple theorems result at once. 


Theorem 1. If о(х) = о &(х), where х is a constant, then 4 = ap and 
o* = ge", 

Proof. Comparison of (7) and (10) yields the first part, while (8) and 
(9) yield the second part. 


Theorem 2. ЇЇ o(x) = а u(x), where œ is a constant, then q = х m and 


o* = а u*. 

Proof. Comparison of (13) and (15) shows that p = vy if Ey = Ho. 
Then (10) and (12) show that q = « т. Finally (9) and (11) yield 
o* —ay*. 


Theorem 3. Let &,(x) and o,(x) be the properties of a particle in a medium 
and e;(x), с,(х) those of a different type of particle in another 
medium. Let 29 = 0 and ос,(х) = х е, (х). Then 4 = « Pı 
and о›* = a €,*. 

Proof. In medium one, o, satisfies (14) with г = ¢,, while in medium 
two gz satisfies (13) with о = cz. By the hypothesis these equa- 
tions are identical and therefore so are the solutions. Thus 
pı = p2 and E, = E;, so (0 and (10) yield q2 = «р, while 
(8) and (9) yield с,* = «є,*. 
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4. Spherically symmetric particles 


Let us assume that o(x) is a scalar depending only upon r, the distance from 
the centre of a particle. Then (13) becomes 


4g + 07! о, p, = 0. (16) 


Let us choose the direction of the applied field Е, to be the z-axis and let the 
magnitude of E, be unity. Then y must tend to z as r becomes infinite. In polar 
coordinates ғ, 6, р with the z-axis as the polar axis, we have z = r cos 0, so 


9 ~rcosé at r=. (17) 
To solve (16) and (17) we set 9 = f'(r) cos 0. Then (16) and (17) become 
f° +@r1t+67106,)f'—2r-7f=0, (18) 


fr at r= oo. (19) 


At discontinuities of o(r), if any, the continuity of y and o ф„ require that f 
and o f' be continuous. 

In terms of f we can evaluate q from (10). By symmetry it is clear that only 
the z-component of the integral in (10) is not zero. Then since the z-component 
of E is p, = p, cos 0 — r^! фо sin 0, (10) yields 


eo л 
g= 2л] | [o(r) — со] [f cos? 0 + r7! fsin? 0] sin 8 d8 r? dr. (20) 
0 о 
Integration with respect to Ó gives 
4m p 7 
= | lo) ол) + 2rF ar. Q1) 
о 
If а(х) is also a scalar function of ғ, (7) shows that p is a scalar given by 


4л p 
p= — |180) — eem + 2rF Ol ar. Q2) 
о 


Let us now consider some examples. 


Example I. A conducting sphere in a conducting medium 


Consider a spherical particle of radius a and constant conductivity о, in a 
medium of conductivity со. From (18), (19) and the continuity conditions we 
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— have 
f(r) = 309 (200 + о): for O<r<a 
and 
fy=r+@(o—0;)Qo9+0;)71r-? for r>a. 
Thus (21) and (22) yield 
4л a? 309 (с; — со) 


= 23 
3 209 + о, Q») 
and 
12 T Oo j 
p =—— | [e(r) — £o] r? dr. (24) 
20g + о, - 
If e(r) is a constant e, inside the particle, (24) becomes 
4z a? 30o (&, — Eo) 
p= —2————— (25) 


3 209 + сі 
If u = ш, for r < a апа и = цо for r >a, where u, and де are constants, 
а similar calculation, or the use of theorem 2, yields 
ма 4л а? 3uo (Hı — Ho) 
3 2uo + Mı 


The result for o* when (23) is used in (9) was obtained by Landau and 
Lifshitz +). 


(26) 


Example ЇЇ. A conducting particle in a non-conducting medium 


Suppose that a spherical particle of radius a and constant conductivity с; is 
surrounded by a medium of conductivity со = 0 and constant dielectric con- 
stant £g. Then (13) holds for r < a and (14) holds for r > a. At the interface p 
and c y, must be continuous. We may still seek y in the form Фф = f(r) cos 0. 
Then (13) shows that f satisfies (18) for r < a, and since с; is constant, o,, = 0. 
For r > a, (14) shows that f satisfies (18) with o replaced by £o, and £o, = 0. 
From these equations and continuity conditions, as well as (19), it follows that 
f(r) = 0 for к< a and f(r) = r—a? r-? for r > a. Then (21) and (22) yield 
q = 0 and p = 0. Then (8) becomes e* = & + o(N V-*) while (9) becomes 
о* = 0 + (N V7). 

The results £* = ey and o* = 0 are exact in this case for all values of N V7! 
provided that particles in contact do not form a path through the entire region. 
This follows because, from (13), y is harmonic inside the particle, while from 
the continuity of o ф,, 9, = 0 at the surface. Thus ф = constant inside the 
particle so E = 0 there. Then (5) shows that є* = ғ, and similarly o* = 0. 
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These results are independent of the shapes of the particles and are also true 
when o is not constant inside them, provided it is not zero. We state this result 
as theorem 4. 


Theorem 4. For perfectly conducting particles in a non-conducting medium, 
e* = в, and o* = 0 provided that particles in contact do not 
form a path across the entire region. 


The conclusion that e* = г for conducting particles in a non-conducting 
medium is in disagreement with the well-known results for this case. This is 
because those results are based upon a different definition of &* from that which 
we have used. It suggests that our definition may not be so useful in this case. 
However, we can recover the usual results by putting o = 0, considering the 
particle to have dielectric constant ¢,, and then letting є, tend to infinity. 
This is shown in sec. 4, example IV. 


Example YII. A conducting sphere surrounded by a spherical layer іп a con- 
ducting medium 


Let us consider a spherical particle of radius a having conductivity о» for 
0 « r < b and conductivity c, for b < r < a, in a medium of conductivity 
Со, Where со, с; and c; are constants. The solution of (18) and (19) for this 
case, subject to continuity of f and o f’, is 


Зо, fa\> 
f= (5 у!» O<r<b; (27) 
05 —90, \b 
2 с 
f= (22, a? у, b<r<a; (28) 
2с, + aN? 
freni eyra, a<r; (29) 
05—060; \b 
where 
a 37-1 
y = 300 (02—01) E (62 — о) (01 — 09) + (200 + 01) 00, +02) (5) ] | 
(30) 


By using this solution in (21) we obtain 


4z a? 2 3с; (05 — 00) + (20; + 02) (à? b^? — 1) (o, — оо) 


о, (31) 
3 2 (0; — оу) (c, — 0o) + (200 + о) Qo, + 02) à? b7? 


q= 


If e = e, for 0 < r < b, € = e, for b <r < a and е = £ for r >a, where 
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£g, €, and £, are constants, (22) yields 


4л a? 3o, (€2 — £o) + (20, + сг) (a? b^? — 1) (e, — ёо) 


—— $gg ———————————————————————————. (32) 
3 2 (02 — 01) (т, — 00) + Qao + 01) (20; + G2) а? b? 


p= 


If и = H2, fy and fo in the regions 0 <r <b, b<r<a and r a re- 
spectively, where the ш; are constants, then m is*given by the right side of (31) 
with с; replaced by ш. 

Example ТУ. A non-conducting sphere in a non-conducting medium 


Now we consider a spherical particle of radius a with dielectric constant €, 
in a medium of dielectric constant ¢9, with с = 0 both inside and outside the 
particle. By applying theorem 3 to the result (23) we obtain 


4л à? 3&9 (=, — ёо) 


“3 260 + € 


p= (33) 


As &,/& — оо we see that р — 4л a? в, which is the polarizability of a con- 
ducting sphere. 


5. Extension to higher concentrations 


The results (8), (9) and (11) can be extended to higher concentrations by 
supposing that each particle is surrounded by the effective medium, rather than 
by the original medium. Thus in (8) р = p(éo) depends upon єо, so we replace 
it by p(e*). Then (8) becomes the following equation, which is to be solved 
for s*: 


&* = в, + N V-! p(e*). (34) 
Similarly, (9) and (11) become equations to be solved for o* and u* respectively. 
As an example, for non-conducting spheres in a non-conducting medium, 
we use (33) in (34) to get 
4л a? N 3e* (e, — є*) 


35 
3V 2&* - в, (85) 


£* = бо 


This is a quadratic equation for e*, the positive root of which is to be used. 
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Abstract 


Bounds are obtained for the cardinality of sets of lines having a pre- 
scribed number of angles, both in real and in complex Euclidean п-ѕрасе. 
Extremal sets provide combinatorial configurations with a particular 
algebraic structure, such as association schemes and regular two-graphs. 
The bounds are derived by use of matrix techniques and the addition 
formula for Jacobi polynomials. 


1. Introduction 


We consider sets of lines in real and in complex Euclidean n-space having a 
prescribed number of angles. In the case of one angle, two types of bounds are 
known for the cardinality of such sets: one in terms of the angle and the 
dimension (cf. refs 13 and 12), the other in terms of the dimension only (the 
Gerzon-bound, cf. ref. 12). In the present paper both types of bounds are 
generalized. The special bound (cf. table T) uses the values of the admitted 
angles. The absolute bound (cf. table IT) uses the number of such angles, not 
their values. 

The essential tool in obtaining these results is the addition formula for Jacobi 
polynomials. The classical addition formula for Gegenbauer polynomials (cf. 
ref. 9), was recently generalized to Jacobi polynomials by Koornwinder 19:11), 
In certain linear spaces of harmonic polynomials this formula is interpreted as 
a (Hermitean) inner product. Sets of vectors on the unit sphere in R^, and in C", 
are characterized in terms of the matrices of their inner products. Thus, prop- 
erties of such sets are related to properties of Jacobi polynomials, and the 
techniques of refs 7 and 8 may be used. 

Of particular interest are the sets of lines whose cardinality equals a bound. 
In the case of one angle these sets are regular two-graphs 15:15); sometimes, 
they provide a combinatorial setting for interesting simple groups. Also in the 
general case the extremal sets give rise to combinatorial configurations with 
interesting algebraic properties, such as association schemes ^9). ^ 
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The cases В” and C" are treated separately in secs 2 and 3, and simultaneously 
from sec. 4 on. The two families of Jacobi polynomials 


{Qo,2(); Q1,.(%); Q2, (X), “ж :ds 


for в = 0 and for e = 1, are defined by recurrence relations in sec. 2. The 
value Q,,.(1) equals the dimension of the space Harm of the harmonic poly- 
nomials in n variables of the corresponding degrees. In theorem 3.3 the addi- 
tion formula is stated in terms of the Jacobi polynomials and an orthogonal 
basis of Harm. 

Let А be a finite subset of the interval [0, 1[, and let X be any finite subset 
of the unit sphere 2, having the property that |(£, 7)|? belongs to A for all 
E x q € X. In sec. 4 the characteristic matrices H,,, are defined from X and 
an orthonormal basis of Harm. The crucial theorem 4.4 yields an inequality 
for |Х | i in terms of the Jacobi polynomials, the characteristic matrices, and a 
polynomial F(x) which behaves suitably for any хє 4. This theorem is ap- 
plied in sec. 5 to the annihilator polynomial of the set А, yielding the special 
bounds of theorem 5.2 and table I. In sec. 6 the characteristic matrices Ho,., 
Нуз Henkes with |a] = s, are combined into the matrix H,, and ap- 
plication of theorem 4.4 yields the absolute bounds of theorem 6.1 and table II. 
Several examples are given, such as those related to the simple groups of 
Conway ^) and Rudvalis, cf. ref. 5. 

In the final section 7 the linear spaces А and A, are defined. A sufficient 
condition for these spaces to be algebras is given in theorem 7.4, which applies 
if the special bound and if the absolute bound is achieved, in theorems 7.5 
and 7.6, respectively. 


2. Jacobi polynomials 


For each of the cases R” and C", with n > 2, and for ee (0, 1}, we define 
the family {Qo, (x), Q1,-(x), . . .) of polynomials Q,,,(x) in one real variable x. 
These are Jacobi polynomials, and share certain properties. We take 0/0 = 0 
and 09 — 1. 


кли 2.1. For єє {0,1} and integer k > 0, the polynomials Ок„(х) are 
defined by the recurrence relations 


Akt Qr 1,0(X) = x 06100) — (1 — Ax) Qx,o(X); 
йк+1 Quai a0) = Qux) — (1— us) Qr), 
with the initial values Q_,,.(x) = 0, Qo,o(x) = 1. 


For the case R”, the coefficients are given by 


2k 2k +1 


p ——— M 
5 n+4k—2 m n 4- 4k 
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The first polynomials are 


Qo,o(x) = 1, 


(n + 2)(пх— 1) 


Q1,0(x) = = == 


n (п + 6) (n + 2) (п + 4) x2 — 6 (n + 2) x + 3) 
22,08) = —— 


24 
Qo,1(x) = п, 
Q, лб) = п 4n 2)х—3) : 


6 


n (n + 2) (n + 8) (n+ 4) (n 4- 6) x? — 10 (n + 4) x + 15) 


Q2,1(x) = 120 


It follows that Qo,o(1) = 1 and, for k+ e z 1, 


dco Geg 


Q«,«(1) = ( nos 


n—i 
For the case C", the coefficients are given by 


k k+1 
КЕИ ОШ, WES 


The first polynomials are 
Оо,о(х) = 1, 
Qo) = (n +1) (1x— 1), 
n (n + 3) (n + D) (n + 2) x? —4 (n + 1) x + 2) 


Q2,0(x) = 4 
Оо„(х) = n, 
0, (х) = ад Дааа А 
2 
п (n + 1) (п + 4) (n + 2) (n + 3) х®— 6 (n + 2) x + 6) 
Q20) = ASS . 


12 
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It follows that 


n+k+e—l1\ fn+k—1 n+k+e—2\ /n+k—2 
Q,, (1) = S . 
n—i n—1 n—i n—1 
Remark 2.2. The present polynomials Q,,.(x) are related to the Jacobi poly- 
nomials С,(р, 9, x) as follows, cf. ref. 1: 


Г(е + 4n + 2k) 
for R": E = Det2k — 64 4(n — 2), 4, x) 
Or Ox, (x) Tn Pe + 2k + 1) (e + (п ) €+4, x) 
Г 2k 
for С": О, (х) = егез G,(s +п— 1, є +1, x). 


Г@т)Г(К 4- 1) Г(в 4- k -- 1) 
For R”, they are related to the Gegenbauer polynomials C, O(x) by 
I'(e + 1 + 2k) I'(m) 


xë Ge + т, € + 3, x?) = 2 I( + m+ 2k) Cerar O). 


We shall refer to the polynomials О, „(х) as Jacobi polynomials. The fol- 
lowing theorem holds for both R" and C", and is a consequence of the defi- 
nitions and of remark 2.2. 


Theorem 2.3. For each e e (0, 1}, the polynomials Qo, (х), Q1,c(%),---» Ок,:(х) 
form a basis for the linear space of all polynomials of degree 
< К. This basis is orthogonal on [0, 1] under a suitable weight 
function. The coefficients in 


f+jte 


x? Qi (x) Qj, (x) = У qw, k) О,.о(х) 
к= 0 
satisfy, with Kronecker 6, ;, 
qo(0, /, k) = бк; 441], 0) = Ql) б). 
3. Addition formulae 
In В", with n > 2, let о, denote the volume of the unit sphere Qu. 


Definition 3.1. Harm (J) is the linear space of all harmonic functions on ©, 
which are represented by homogeneous polynomials of degree / 
in n variables. 


It is well known ?) that Harm (/) has the dimension 


x-no- (21-071) fo 121, М@)=1. 


n—i n—1 
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In C", with n > 2, endowed with the Hermitean inner product, let о, de- 
note the volume of the unit sphere Q,. Let 


S(y;z) = SO. ++, Уп) 2,6, Zn) 


denote any polynomial in 2n variables, with complex coefficients, homogeneous 
of degree / in y,, .. ., Yn homogeneous of degree k in z,, ..., 2,. It is called 
harmonic if it satisfies the condition 


n 


2 
= 0. 
Oy, Oz, 


iz1 


Definition 3.2. Harm (/, k) is the linear space of all functions S on 2, defined 
by 
Veen, (SCE) :— S(6;5)), 


where S(y;z) is a harmonic polynomial, homogeneous in y;z 
of degrees /;k. 
It is well known +°) that Harm (/, k) has the dimension 


nd l—1N/n--k—1 п 1—2N (n+k—2 
N = №1, k) = — ) А 
п — 1 n—1 n—i n—1 
Theorem 3.3. For each єє {0,1}, and integer k >> 0, the linear spaces 
Harm (2k +- e) in the case R”; Harm (k -+ =, К) in the case C" 
have the dimension N = О, „(1). For any orthogonal basis 


(51, S2, ..., Sy) of these spaces, with norm (S) = Мо, the 
following addition formula holds: 


Venem ( У 8,0) 50) = (5, )* О, (6, D). 


t=1 


In view of remark 2.2, this formula coincides in the real cases with the classical 
addition formula for Gegenbauer polynomials, cf. ref. 9, In the complex case, 
this formula is a special case of the addition formula for Jacobi polynomials, 
which was recently obtained by Koornwinder 1011). | 


4. Characteristic matrices 


From now on, we treat the real and the complex cases simultaneously. Let 
X denote a finite nonempty subset of the unit sphere Q,, of cardinality |x | =: 0. 
For any fixed labelling of Y, for ce {0, 1}, for integer k > 0, and for any 
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orthogonal basis (S,, $2, ..., Sy) with norm (S) = Vos of Harm (2k + е), 
and Harm (k + e K), respectively, we define the matrix Hp, as follows: 


Definition 4.1. The characteristic matrix Ну, of size vx N, with N = О, (1), 
is the matrix 


H; = [S8], &єХ, te {1, 2,55 №). 


Thus, each column of H,, consists of the values taken on the vectors of X by 
the corresponding basis polynomial. Without loss of generality, we let Ноо be 
the all-one vector, which we denote by u. We use the following notations. For 
any matrix M its conjugate transposed is denoted by M, and its norm by 
|м || = (tr MM)*/2, The matrix A,,, denotes the zero matrix 0 for k % 1, 
and the unit matrix J for k = 1. 


Lemma 4.2. Hy, Ñr, = KE, M° О, (E D]; 


|Ak. HP = У [63 n) Q.,«|(5. |?) Q, |. m|; 
nex 


Ane Hy,,— v Д? = ||Й B, —%? Q;,.(1). 
Proof. The first identity is a direct consequence of the addition formula of 
theorem 3.3; the second follows from the first one by straightforward verifica- 
' tion; the third one uses 

|| a — v r|P = || m|| — 20 oa] + v |Д. 
We now approach the crucial theorem 4.4. For any ee (0, 1) we associate 


to any polynomial F(x) e R[x] its expansion in the basis of the Jacobi poly- 
nomials О, (х): 


Е(х) = у Лх Ox, (x). 


Definition 4.3. For any integer і >> 0, the polynomial F(x) is called (i, &)- 
compatible with the set Y C (2, whenever 


ee (Ie à Onde DD FUE DP) < o). 


Theorem 4.4. For any ee (0, 1) and integer i > 0, any polynomial F(x) which 
is (i, e)-compatible with a set X C О, of cardinality v satisfies 


v QD) G0) ол) > Y fis Ё. Нь. v Au]. 
k=0 
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Moreover, equality holds if and only if 
Vener (Ie ое rd p» = о), 


Proof. Consider the sum 


Y= У E m| 0,16, D Е, op. 


nex 


Since F(x) is (i, e)-compatible with X we have 


È € Qi, (1) FQ) = v 0, (1) FQ). 


eX 


On the other hand, expansion of F(x) in {Q,,,(x)} and application of theorem 
4.2 yields 


у=}, У [602° o.qQ 0 cE DD See 


k=0 ¢,neX 


© 
= Y fus £s н, |? 
k=0 


= Л 1. Н. v Aral? + 0? fie 9,0. 


k=0 


This implies both assertions. 


+ite 


Lemma 4.5. ||Й„Н„„—® 4, ||? = È geij, K) || н, ||, 


where и is the all-one vector, and q,(i j, k) are as in theorem 2.3. 
Proof. Using lemma 4.2 and theorem 2.3 we observe that 


itj+e 
|Ë, н? m 2 4-0, j, k) У О,.0(|0, |, 
k=0 у 4 


|l oll = p Or, (lE, NID. 


Again by 4.2 and 2.3, this readily leads to the assertion. 
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5. Special bounds for A-sets 


Let A = (a, a, . . ., e) denote a finite set of s > 1 distinct real numbers, 
with 0 < о «1 • 


Definition 5.1. A finite nonempty set Х С (2, is an A-set whenever 
A= ((& mp: £z nex). 

Theorem 5.2. For any єє (0, 1}, let F(x) be a polynomial satisfying 
Vaea (0° F(x) <0); Visi (fi. 20); Љ,. > 0, 
for the coefficients f,,, in its expansion in О, „(х). Then 
|X| <F()/fo,e 


Proof. Since Ов, is a positive constant, F(x) is (0, e)-compatible with X. Now 
apply theorem 4.4, and use the hypothesis. 


Remark 5.3. In the case of equality the second part of theorem 4.4 implies that 
Vaea (a* F(a) = 0), Ver (Sire > 0) = (ü Hr, = 0). 


This is useful for a discussion of A-sets achieving the bound. We shall not 
pursue a complete discussion in the present paper. However, we refer to sec. 7, 
in particular theorem 7.5. 


We now make theorem 5.2 explicit by special choices for e and F(x), de- 
pending on A. Call A* := A\{0}. 


Definition 5.4. Type (A) equals 1 for Оє A, and 0 for 0 є A. 


Definition 5.5. The annihilator of A is the polynomial 


x—a х—@ 

=| | =| | , With €= type (A). 
1—@ 1—@ 

. aed 


acA" 


Thus, the annihilator of A is the polynomial of degree s — e which vanishes 
for all « e A*, and takes the value 1 for x = 1. We now apply theorem 5.2 
for є = type (A) and F(x) the annihilator of A. This yields a bound v(A), say, 
for v — |Х | provided fo, > 0 and all fke 2 0 in the expansion of F(x). 
Table I contains the results, both for R” and for С", for s = 1, 5 = 2, and 
s == 3 with 0 € A. The validity of v << v(A) depends on two conditions: 

(i) the denominator of v(A) should be positive, 

(ii) « + B should not exceed a certain value, К say, for s = 2, 3. 

The results of the table are valid for all such values of 0 Sa < 1,0 « B « 1, 
hence also for f = 0 and for « = f. 


- 
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TABLE I 


Special bounds 
field 004) 


n (1 — o) 


1—n« 


R,C | {a} 


R (o. В} n(n + 2) (1 — а) (1 — В) 
3— (n+ 2) (a B) - n(n + 2) «В 


n (n + 1)(1—9) 0 — B) 
2— (n+ 1) (6+ 8) --n(n + 1) В 

n (п + 2) (n + 4) (1 — о) (1— p) 
15— 3 (n-- 4) (a B) + (0+2) (n- 4) B 

n (n + 1) (0+ 2) (1 —99 (1— 8) 
6—2 (n + 2) (+ B) + (n + 1) (и + 2) 6 


С {а, В} 


R (0, æ, 8} 


С (0, о, В} 


Remark 5.6. The annihilator of A needs not be the best choice for F(x). This 
is illustrated for s = 1 and s = 2 in the case R". Let A = {a} and B = (a, f) 
with 0 < f < a < 1/n. Then for B the choices e = 0 and F(x) = x — a satisfy 
the hypothesis of theorem 5.2, yielding v < n (1 — «)/(1 — по), which is better 
than v(B) of table I, for 0 < 8 < a < 1/(п + 2). Conversely, in case of A-sets, 
v(B) with any suitable f yields a better value than v(A) for « > 1/(n + 2). The 
limit value х = 1/(n + 2) yields v(4) = łn (n + 1) = v(B), for every f. In 
fact, 4n (n + 1) is the absolute bound of table II. 


Example 5.7. In the case R", A = (a), table I yields 


n 234 5 6 7 15 19 20 21 22 23 
1 459 9 9 9 25 25. 25. 25 25 25 
А) 3 6 6 10 16 28 36 76 96 126 176 276- 


In the cases n = 19 and n = 20, it is unknown whether the bound v(A4) can be 
achieved. In all other cases an extremal set of equiangular lines has been realized 
by a regular two-graph, cf. refs 15 and 16. Sometimes, these sets have interesting 
automorphism groups: for n = 21 the unitary group PI'U(3, 52), for n = 22 
the Higman-Sims group, for п = 23 Conway's group :3. 
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Example 5.8. In the case C", A = {a}, we realize 
n=2m, «`! =4m—1, v(A) = 4m, 

for many values of m, as follows, cf. ref. 17. Let C be a skew conference matrix 
of order 4m, that is, a skew matrix with elements 0 on the diagonal and + 1 
elsewhere satisfying ССТ = (4m — 1) Г. Such matrices coexist with skew 
Hadamard matrices of order 4m. The complex matrix 7 + i (4m — 1)? С 
is Hermitean positive semi-definite of rank 2m. Hence, C?" contains 4m unit 
vectors with Hermitean inner products + i/(4m — 1)!/?. 


Example 5.9. In the case C", A = (0, «}, the following examples have been 
realized: 


(n, «t, v(A)) = (5, 4, 45), (9, 9, 90), (28, 16, 4060). 


The first and second example may be obtained from the regular two-graph on 
276 vertices, and will be treated elsewhere. For the third example, whose 
automorphism group is Rudvalis' simple group, we refer to ref. 5. 


6. Absolute bounds for A-sets 


In this section we obtain upper bounds for the number of vectors of an 
A-set X C Q,, depending only on the cardinality s and the type ғ of A, and 
not on the elements «,, %2, ..., % of A. Moreover, if the bound is achieved, 
then the elements of A turn out to be determined by n, s, €. 


From the characteristic matrices Hp, of a finite set Y C (2, we construct 
the matrix 


H, m. [Но,в› Hi,» EE Н,_.,]. 
Since Hp, has Q,,.(1) columns, Н, has 


5—8 


Ms, £) = у Q,,,e(1) 


columns. By use of the explicit expressions for Q,,-(1) mentioned in sec. 2, 
we deduce 


2s—e—1 n — 1 5— 8—1 
mo, - (^7 { ) к; -( ux C : уве" 
п — 1 п— 1 п — 1 


Theorem 6.1. For any A-set X С Q, the inequality 
|x| < M(s, 2), with s=|A|, e= type (4), 


holds. In the case of equality the annihilator of А is the poly- 
nomial i 


157* 
a У: О, (х). 


V k=0 
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Proof. Let в = type (A), let F(x) be the annihilator of A, and let 
F(x) = 3 fece О, (х) 
к=0 


be its expansion in the Jacobi polynomials О, „(х). Define the diagonal matrix 
A, of size M(s, £) by 


A, = 5, To ФЛ, ©... © f ue Is 


where J, denotes the unit matrix of size О, (1). By use of lemma 4.2 and the 
expansion of F(x) we observe that 


H, A, H, = KE, n) Fé 921 = 1. 
This proves the inequality for v — |X |, since it implies 
min (v, M(s, є)} > rank H, > rank (H, 4, Ñ.) = rank I = v. 
Next, we assume the bound to be tight, that is, 
v = M(s, £). 


Then Н, is nonsingular and A, is positive definite. Hence all diagonal entries 
fx, Of A, are positive. Since F(x) is (i, £)-compatible with X, application of 
theorem 4.4 yields 


O<fi.e<1/v, for і= 0, 1,...,5— е. 


In addition, from 
v=v0v Е(1) =v > Sine Qi,e(1) < > Q,,-(1) = Ms, Э! 
i20 1=0 


it follows that 
Sine = 1/2, for i—0,1,...,5—&, 
which implies the assertion. 


Remark 6.2. 'Theorem 6.1 implies that, if the bound is tight, the elements of 
A are the zeros of the polynomial 


Y. Qn). 


From the defining equations for О, „(п;х) := Q,,-(x) it can be proved, in 
view of remark 2.2, that 


r 


n 
i Qi (1x) = ба OED Qr, (n + у; х), 
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with y = 2 for R” ,and у = 1 for C". This again stresses the importance of 
the Jacobi polynomials for our theory. 


Table II contains the explicit bounds, and the accompanying annihilators, 
suitably normalized, for R” and for C", in the cases s = 1, s = 2, and s = 3, 


e= 1. 


: 


3 
= 
B+ we + © 4 


ee len 


vu + кю + 


TABLE II 
Absolute bounds 


annihilator 
n 1 
+ 
2)x—1 
^ ) (n+2)x 
п? (n+1)x—1 
2 
) (n+ 4) х — 3 
(п + 2) х— 2 


Ww 


(п 4- 4) (п + 6x? —6(n-4- 4) x -3 


mT 


(n+ 2) (п 4-3) х2 —4(п + 2) х + 2 


aN 


М МЯ е МУ 


(n + 6) (n + 8) x? — 10 (n + 6) x + 15 


(n + 3) (п + )x*—6(n-4-3)x--6 


Example 6.3. For К", the following realizations are known, сЁ. refs 4 and 15: 


n=71, M=2 , A= {i} , Aut = Sp(6, 2); 
n=23, M—216 , A= (hl , Aut = Соп :3; 
n=8, М = 120 , А= {0,}} , Аш= W(E,); 
п = 23, М = 2300, А = (01) , Аш = Сор 2; 


М = 98280, A= {0, +, 2), Аш = Сор -1; 
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Example 6.4. For C", the following realizations are known: 
n=2, M=4, A= 2} ; n=3, M=9 , A= {4} ; 
n=4, M=40, A= {0,4}; n=6, М = 126, A= (0,1). 


Details about these line systems, for which we refer to Coxeter 9) and 
Mitchell 14), will appear elsewhere. 


7. Properties of extremal A-sets 


In this section we exhibit some algebraic and combinatorial properties of 
A-sets achieving the bounds of theorems 5.2 and 6.1. For similar results in the 
theory of t-designs, we refer to Cameron 3) and Delsarte $). 

For A = (a, ..., as}, let X C Q, be an A-set of cardinality v. For any 
labelling of X, for any єє {0, 1} and je (1, 2,..., s), the square matrix D,, 
of order v is defined by its elements 


D, AE, 1) :— (5, my if |66, m) =o; = 0 otherwise, 


where ё and y run through X. Next, we define the linear spaces Ay and A, over 
the field C as follows: 


A, = (1, Di, Dao tty Dao 
Since D, = 0 for a, = 0, we have 
dim A, = s--1—ó2, where д = type (A). 


We are interested in conditions for A, to be an algebra, that is, to be closed 
under matrix multiplication. Among the reasons for this interest are the com- 
binatorial properties of such algebras. Rather than going into details, we 
mention the following examples. 


Example 7.1. For an A-set X two distinct elements £,2 € X are called ith 
associates whenever [Ca DE — a, holds. This definition yields an association 
scheme if and only if A, is an algebra, cf. ref. 2. 


Example 7.2. In the real case R”, an A-set X with s = 1, type (А) = 0, for 
which А, is an algebra, corresponds to a regular two-graph, cf. refs 15 and 16. 

For ee (0, 1} and X C Q,, let Ho,,, H,,,, H;,,, .. . denote the characteristic 
matrices defined in 4.1. We construct the following matrices of order v — lx |, 
for i = 0, 1, 2,...: | 

Jie = v^! Hi. f... 
Lemma 7.3. If i Н, о = 0, fork = 1, 2, .. 24 + в, then Jo, Ji, + © +> Jae 
are idempotent and pairwise orthogonal. 
Proof. By application of lemma 4.5 we have 
Hy LH, = Aij for Ozxi--jx2d, 


which proves the assertion. 
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Theorem 7.4. If for an A-set X C Q,, with s = |4, 6 = type (A), we have 
i Ayo =0, for 1<k<2(—1l—ed)+¢, 


then A, is a commutative algebra. 
Proof. Let X be an A-set with A = {01, «5, ..., Œs}. By the first formula of 
lemma 4.2 the matrices Ј,,, satisfy 


v J,. = Qi, OH EV Y Qi, (aj) Ру, 
Ј=1 
hence belong to A,, for all integers і > 0. Put 
d:=s—1—e6=dimA, — 2. 
If we can prove that the d+ 2 matrices 
1, Јо,» Л HAE Да, 


form a basis for A,, then the assertion follows by application of lemma 7.3. 
So suppose that these matrices are linearly dependent with coefficients 
Coos Cos Cis ++ - Ca, With at least one 0 5 се (co, с, ..., Ca}. Then, by sub- 
stitution, we have 


(co v + х Ci Qi, о) 1+ у 10: Y Ci 9) р), г = 0. 


J=1\i=0 


But J, D,,, Р, ..., Ds, are linearly independent, except for D,,, = 0 in 
the case a, = 0, є ô = 1. Hence the nonzero polynomial 


Co Qo, eX) + с, О.) +... + Ca Qu), 


of degree < d, vanishes for all о; e A, except possibly for «, = 0 and e ô = 1. 
This is impossible by d = s — 1 — є б. Now the theorem is proved. 


Theorem 7.5. Let the annihilator F(x) of A, with s = |A|, 6 = type (A), satis- 
fy fi; >O for i= 0, 1, ..., s— д. Let 
fos =f =... =» for t:= max {0, s +e—2(1 + eô)} 
If XY C Q, is an A-set achieving the special bound 
vfo, = F(1) =1, 


then A, is a commutative algebra. 
Proof. Let the polynomial x't® F(x) have the Jacobi expansion 


stt 


x+? F(x) = у с, Ок,о(х). 


k=0 
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The coefficients c, may be expressed in terms of the given /; з, and it readily 
follows from the recurrences of sec. 2 that 


Со = fo,» €& 70, for kK—0,1,...,5-F t 


Since x'*^ F(x) is (0, 0)-compatible with X, theorem 4.4 may be applied. To- 
gether with c, > 0 the condition F(1) — v со = 0 yields 


ü Ayo = 0, for ]zksx;s-t. 
By theorem 7.4 this proves the theorem, since 
stt>2(s—l1—e6)+6. 


Theorem 7.6. If an A-set X achieves the absolute bound v = M(s, ô), then Ao 
and A, are commutative algebras. 

Proof. According to theorem 6.1, the hypothesis implies / 5 = fi =... 

= f,_s,5 = 1/v. Hence the desired result directly follows from theorem 7.5. 


Remark 7.7. Theorems 7.5 and 7.6 yield information about the extremal A- 
sets, and provide necessary conditions for their existence. 
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Abstract 


The invariance of the difference between the instantaneous magnetic and 
electric energies in equivalent passive reciprocal networks leads to a 
characterization of any real transient (even secular) as dominantly 
magnetic or electric. Algebraic expressions for the associated invariants 
are deduced both from the state-space and the scattering descrip- 
tions of the network. This yields bounds on the number of independent 
resistances in any realization. By contrast, a network having only com- 
plex eigenmodes is continuously transformable into its dual and always 
admits an antimetric equivalent. 


1. Introduction 


The equivalence problem for closed passive reciprocal networks has been 
treated in a previous paper !) and various canonic forms were obtained. Two 
equivalent networks have the same determinant (the same eigenvalues) and the 
same elementary divisors (imposing identical structures to eigenspaces associated 
with multiple eigenvalues). Moreover, reciprocity preservation forbids changing 
RL-circuits into RC-circuits, so that simple real eigenvalues are distinctly 
characterized as magnetic or electric. 

Inthis paper we establish a similar distinction for real secular modes, although 
these necessarily involve all three kinds of elements (R, L, C). More precisely, 
for a fully defective real mode of order s, where every state is of the form 
f (t) exp (о t), with f a polynomial of degree < s— 1, the Lagrangian (dif- 
ference between the instantaneous magnetic and electric energies) is of the form 


L = T, — T, = g(t) exp (—2 ao t) 


where g is a polynomial of degree < s— 1 whose leading coefficient has a 
constant sign, for all initial conditions exciting that secular mode alone, so 
that the asymptotic decay of L is either dominantly magnetic. or dominantly 
electric. Every real mode (secular or not) is thus characterized by an invariant 
reactivity index (v = --1for a magnetic, т — —1 for an electric mode) which 
will be computed algebraically from the network description. 

In secs 2 to 6 the state-space description is used. The relation between rec- 
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iprocity preservation and Lagrangian invariance is discussed in sec. 2, supple- 
menting our first treatment !). This invariance essentially prevents a network 
having real eigenvalues to be equivalent to its dual; by contrast, dual equiv- 
alence is allowed for networks with only complex eigenvalues, as proved in sec. 3. 
Real secular modes are analyzed in sec. 4, and their energy behaviour in sec. 6. 
The reactivity indices are computed in sec. 5 from the state-space description 
and in sec. 7 from the scattering description. In sec. 8 a lower bound is deduced 
for the number of resistances in any realization of a prescribed transient 
behaviour. 


2. Lagrangian invariance 


In state-space synthesis, a minimal -realization of an n-port of degree m is 
obtained as a frequency-independent (n + m)-port closed on m unit reactances. 
If H is the hybrid matrix of the (n + m)-port, all equivalent minimal realiza- 
tions are deduced by the similarity transformation replacing H by 


F=T HT (1) 


where T is an arbitrary non-singular matrix leaving invariant the n free ports. 
If, in addition, the initial realization is reciprocal, the matrix Н is 0-sym- 
metric, i.e. satisfies 


0H—H'0 (2) 
where 0 is diagonal of + 1 entries. Reciprocity is preserved in (1) if one also 
has 

0, F—F'0, (3) 


for some matrix 0, of the same type as 0. From (1) to (3) one deduces 


HU=UH (4) 
where 
U-—TO0,T' 0. (5) 


Youla and Tissi ?) have proved algebraically that for minimal realizations the 
only solution С of (4) is the unit matrix. This produces 


06-28 ` (6) 
and 
ТӨТ'=Т'ӨТ = 6. (7) 


If x denotes the vector of independent variables at the ports of the (n F m)- 
port, (7) expresses the invariance of the quadratic form 


І=х' 0х (8) 


іп the transformation from x to T x. In (8), the terms corresponding to the 
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n free ports are invariant anyway in case of identical excitation, and the remain- 
ing terms represent 2 (Т„— 7,). The invariance of Т, — T, has been proved 
previously by Tellegen ?) and implies, in particular, the invariance of the 
reactive power in steady state. 

For two non-minimal realizations of an n-port, even (1) does not hold in 
general. In the case of reciprocal n-ports, however, the additional freedom 
produced by non-minimality corresponds to internal states that are both inob- 
servable and uncontrollable from the ports, so that the invariance of Т„— Te 
still holds for zero-state equivalence, as mentioned by Tellegen on one example. 

In this paper we only deal with the equivalence problem for closed reciprocal 
networks (without free ports). If the network is the m-port of hybrid matrix Н 
closed on т reactances, the transients are ruled by the equation 


dx 
Hx + — = 0. (9) 
dt 


If x is changed into T x in (9), premultiplication by T^! of the result yields 
Ех -+ dx/dt = 0 where F is (1), so that all equivalent networks are obtain- 
able by similarity. On the other hand, reciprocity preservation leads to (4) and 
(5), but it is no longer true that the only solution of (5) is U = 1,„. For instance, 
for Т = lm, any diagonal matrix F = H satisfies (2) and (3) with 0 and б, 
separately arbitrary, so that (5) reduces to 0, 6, an arbitrary matrix of + entries. 
Consequently the invariance of (8) for equivalent closed networks, where 
equivalence is defined as the preservation of the form (9) of the state equation, 
is not an algebraic consequence of reciprocity conservation. Indeed, an RL- 
network and an RC-network of identical degrees obey the same equation (9) 
but with different physical meanings of H and x, whereas (8) is positive in the 
first case and negative in the second. In conclusion, one must either accept that 
an RL-circuit is equivalent to an RC-network, which seems violently un- 
physical, or strengthen the very concept of equivalence with respect to the 
mere preservation of the form of (9). 

At this stage it is important to analyze the origin of the difference between 
the case of equivalent minimal n-ports, where (7) is a consequence of reciprocity, 
and the case of closed networks, where it is not. For n-ports, the possibility of 
applying forced states x exp (p t) for all p allows one to explore the internal 
behaviour from the ports, whereas only free states at discrete frequencies exist 
in closed networks and there is no possibility of analytical continuation. Physical 
intuition thus suggests the possibility of proving (7), also for closed networks, 
provided some form of continuous perturbation is allowed. In other words, the 
concept of a frozen network without interaction with its environment is an 
excessive idealization in which some essential physical properties аге lost. 

Continuous perturbations can be introduced in a number of ways, and the 
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choice is a matter of taste. One may couple very weakly all reactances of a closed 
network to some common port and apply continuous excitations from the port. 
Alternatively one may allow tolerances on the components and require reciproc- 
ity to be preserved for all reciprocal perturbations. The second approach is 
followed in the appendix, where (7) is thus established for closed networks. 


3. Networks without real modes 


If a matrix H has no real eigenvalues, its characteristic polynomial 
det (H — 4 1m) 


is a product of quadratic factors that are sums of two squares of polynomials, 
and it is itself of the form f?(4) + g?(A). Since Н satisfies its own characteristic 
equation, one has f?(H) + g?(H) = 0 for some polynomials f and g in H, 
with scalar coefficients. All powers of H are 6-orthogonal with H and com- 
mute, so that the rational function r = fjg is defined and the matrix О = кН) 
is also 6-orthogonal and commutes with Н. One thus has 
Q'——l,; 090Q—Q'065 QO HQ=H. (10) 
Since the order of a matrix without real eigenvalues is even, we write m = 2n. 
Moreover *), one has tr 0 = 0. We thus define the matrices 


1, 0 0 exl. 
js | | ; Y= [ | (11) 
0. 1 1, 0 


y ү = In y? = —1м; yw 0 y = — 6. (12) 


and one has 


The partitions on the hybrid matrices induced by (1 1) are 


RON G м 
н=| } vanv-l | (13) 
-N G —N R 


and the second network is the twisted dual of the first one, since the transfor- 
mation of matrix ¥ also interchanges inductances and capacitances owing to 
the last equation (12). Since Y is thus not 6-orthogonal, (8) is not invariant in 
the transformation (13). We now prove that, if H has no real eigenvalues, there 
exists a -orthogonal similarity transformation relating both matrices of (13). 
In fact, (7) and 


T" HT-—-V-HWV 


become identities for Т = О У, by (10) and (12). 
A network satisfying 


V-iHW.-H, | (14) 
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ie. G = R, N = N', is antimetric. Having proved that a network without real 
eigenvalues is reciprocally equivalent to its own twisted dual, i.e. that it is 
potentially antimetric, we now show that it can be made actually antimetric, 
i.e. that there exists a 0-orthogonal transformation T from Н into (1), such that 
(14) holds for F. One must thus find a matrix T such that 


VT OHTWV-TOHT (15) 
and (7) hold. From (10) one deduces 


so that Q has n eigenvalues +j and п eigenvalues —j. Moreover, the nullity of 

-each factor of (16) is at least n, so that О has exactly n eigenvectors for each 
eigenvalue and is non-defective. Аз a consequence 5), Q is 6-orthogonally 
similar to У, so that 


Q—TVT-! (17) 
holds for some T satisfying (7). This allows to replace T Y by Q T in (15), 
which is thus established. In fact, antimetric canonic forms have been obtained 
in a previous paper °). 
4. Jordan canonic forms 

The Jordan canonic form 

J=T HT (18) 

ofa general 6-orthogonal matrix is a direct sum of blocks Jy + А, 1,, one 


block per independent eigenvector of H. Since J is not 0-symmetric, T is not 
6-orthogonal, but (2) imposes 


Р VJ—J'V (19) 
where 
V=T' 0T (20) 


is non-singular symmetric. On the other hand, T is generally not unique since 
any transformation matrix A leaving J invariant, i.e. satisfying 


ЈА= АЈ (21) 


сап be incorporated into Т, which becomes ТЈ, and this changes (20) into 
A' V A. With E, denoting a matrix of order k of unit entries on the second 
diagonal, and calling E the direct sum of blocks E, conformal to the partition 
of J, one has 


EJ=J'E (22) 
so that, if A satisfies (21), 
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V=EA ; (23) 
satisfies (19). 
The general solution of (21) is well known from the theory of commuting 
matrices 7). If each Jordan block corresponds to a distinct eigenvalue, A is a 
direct sum of blocks 4,, and each block is of the form 


а а а, 
0 & a 
0 1 (2 4) 
0 0 ag 
Correspondingly, (23) separates into symmetric blocks of the form 
0 0 a 
0 ag a 
о 1 (2 5) 
в а а, 


If two blocks J; and J, (possibly of different orders) have the same eigenvalue, 
the corresponding superblock of A (corresponding to the direct sum of J, and J;) 


is of the form 
Ay A 
| “ Ik | (26) 
Аа Akk 


where each submatrix is of the form (24), the off-diagonal submatrices being 
completed by additional rows and columns of zeros when they are rectangular. 
When (26) is used to generate the corresponding superblock of V, by (23), the 
arbitrary off-diagonal parameters of (26) are further restricted by the sym- 
metry of V. The resulting form of the superblocks of V is illustrated by the 
example ; 


o 

y, У, Go Wi bo 

V, V, | Caec a bo. Pr 07) 
bo : Co 


bo by {со су 


which occurs if a 5-tuple eigenvalue has a Jordan structure of orders 3 + 2. 
If V is (27), the off-diagonal matrices corresponding to V, are reduced to - 
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zeros in the transform А” V A, where A is (26) with unit diagonal submatrices, 
with A,, = 0 and with A,, determined by 


Va Em + Y, = 0. (28) 


This diagonalization process for submatrices is exactly isomorphic to the clas- 
sical Gauss algorithm reducing scalar off-diagonal entries to zeros in a sym- 
metric matrix. The pivot submatrix, V, in (27), must be non-singular, and this 
is automatically true if V, is the submatrix of largest order: since (27) is non- 
singular, one has a) 5 0. In the case of a partition into submatrices of iden- 
tical orders, both diagonal submatrices of V can be singular, but the off-diagonal 
submatrix is then non-singular; as in the Gauss algorithm one then generates 
non-singular diagonal submatrices by an orthogonal transformation of matrix 


EJ к | | (29) 
y2L 1, –1, 

One thus obtains for V a direct sum of blocks (25), to be called V,, of the same 
orders as the Jordan blocks Лк, in all cases. 

The blocks V, corresponding to real eigenvalues are real. The corresponding 
submatrix 4, — E, V, of the form (24) has a k-tuple eigenvalue ag. For 
ао > 0, A, has thus a unique real square root of the same form, and the 
transform (4',)* !? V, А 1/2 is (47) 1? E, Agt’? which is E, because every 
matrix M of the form (24) satisfies M' E, — E, M. Consequently one can 
reduce V, to v, Ep, where v, is the sign of the entry ag in Vp. For blocks 
associated to complex eigenvalues 4,7 !/? can be complex, and one can always 
achieve V, — E,. | 

In conclusion, we have proved that a 60-symmetric matrix is similar to its 
Jordan form by a transformation T' satisfying (20), where V is a direct sum of 
blocks V, = E, for complex eigenvalues and of blocks V; = v, E, with 
тк = + 1 for real eigenvalues*). 


5. Invariance of the reactivity indices 


Wenowprove that the numbers T, associated to real eigenvalues are invariant, 
ie. that the same set is deduced from any diagonalization process of V into 
separate blocks V,. From (18) and (20) one deduces the identity 


VU --pl)'-T8(H-ctply'T. (30) 


If о; is a real eigenvalue ‘of multiplicity s of H associated to a single eigen- 


vector, the principal value of (Н + p 1,)^* near p = — a, is determined by 
the partial fraction expansion 


*) This lemma was used in the proof of theorem 5 of ref. 1 but the derogatory case was not 
analyzed in detail. In the meantime the same lemma appeared in F. Uhlig, Lin. Alg. 
Appl. 8, 351-354, 1974. 
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К, К, K; 

M ME с АА 

poto (pda) (p + о) 


where all matrices К, of order m are 0-symmetric. Correspondingly, the prin- 
cipal value of (J + p 1,,)~1 results from the expansion 


M КЕ 


i J, Ј2 a gi 

[+ + рар 
pe (pco) (p+a,)3 (p + œ) 

(32) 


of a single Jordan block. Since (32) is premultiplied in (30) by V, = v Е,, one 
has by comparison with Т” 0 (31) 6, 


(—I*TEJ^—T'OK,,.T (A =0,1,...,s— 1). (33) 
In particular, for k = s — 1, (33) has rank one, and 
T = (—1y7! sgn tr (0 K)). (34) 


Since the sequence of matrices Е, J," of (33) written in reverse order 
(k—5—1,5—2,...) is 


1 1 (35) 
1 |, 1 hli 


the successive ranks (signatures) are 1, 2, 3, . . . (1, 0, 1, . . .) and the remaining 
relations (33) fix the ranks and signatures of all symmetric matrices 0 K, for 
kxs—2. 

The case where the eigenvalue o, is associated to several eigenvectors will be 
discussed on the basis of the example of two eigenvectors corresponding to two 
Jordan blocks of orders s and t. Assuming 5 > t, the expansion of (H + p 1,)-! 
replacing (31) thus stops with K,, whereas (32) must be replaced by the direct 
sum of two expansions, so that (33) is replaced by 


(—1)* Ҝ LN. ] TOKT (Е=0,1 1. (36) 
— = = U, 1,...,5— 1). 
0 mE SE od | 


If s = t, (36) has rank two for k = s— 1 and 0 К, is congruent to 


EM 0 
T9 E 2 


Consequently т, and т, are determined by the signature of 0 K, within an 
irrelevant permutation. If s >t, one has J?^* = 0 and т, is determined by 
the signature of 0 K, in (36) written for k = s— 1. If k is then decreased, one 
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still has Л,“ = 0 in (36) as long as k > t — 1, so that the ranks and signatures 
vary as in (35). For k — t — 1, 7; appears for the first time in (36) so that the 
rank suddenly increases by 2 for k changing from /— 2 to Г — 1, and the 
direction of the deviation in the signature evolution from the pattern 1, 0, 1, 
0, . . . determines т,. It is clear from this example that in all cases the reactivity 
indices v, of all Jordans blocks associated to a multiple real eigenvalue are de- 
duced uniquely from the ranks and signatures of the matrices 0 K, arising in 
the partial fraction expansion of the resolvent matrix 0 (H + p 1,)-!. 
6. The energy of a secular state 
With 
x=Ty (37) 


and (18), the equation (9) reduces to 
dy 
Jy+—=0 (38) 
dt 


and splits into separate equations in accordance with the partition of J. In the 
following we thus only treat an isolated system corresponding to some real 
eigenvalue o, of order s associated to a single eigenvector, and thus discuss 
the equation 


dy 
(Js + 0,1)y t = 0 (39) 
dt 
whose solution is 
у = exp (—a, f) exp (Js t)a (40) 
where a is an arbitrary s-vector. In (40) one has 


5—1 


(s— 1)! 


exp(J;f) = 1, d-tJ; +... + Jot (41) 


since higher powers of J, vanish. 
By (37) and (20), the Lagrangian (8) is L = у V y and reduces to ту' E, y 
for the mode (40). By (40) and (22), one has 


L = t exp (—2 o, t) a’ exp (J,' t) E, exp (J, t) a 
= texp(—2 a, t) а E; exp (2 J; t) а 


5—1 
(21) 
= техр(—2 a, t) vs ce (42) 
k=0 | 


k 
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where €, = а E, Jj a. 
The vector a can be noted i 
asi 
Ta 
а=| : |=(@æ@ls+a J+... +a- ITY): |=Ан 


ау 


о 


do 
where 4 is (24) of order s. Since (41) is also the matrix 


г 12]2 


of order 5, опе has 
y = exp (—a, t) P Au. 


Since P A u is the last column of P A one finally has 
{57-1 15-2 
ao ———— + a, ———- +... 
CDU GS 
ts-2 15-3 


Уеа cy ш (s— 3)! v 


At+a, 


do 
On the other hand (43) is 


Cy =u А EJ Аи = и Е, А? Ји = и E,J Bu 
where 
В = А? 
is а matrix of type (24) with 
b, = 4’, 
b,—2aga, 
b, =2 a a, + а,?, 
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(43) 


(44) 


(45) 


(46) 


(47) 


Finally J,* В is a matrix of type (24) where the subscripts of the entries b; are 
decreased by k, the negative subscripts being replaced by zeros. In (46), the 
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matrix E, J," B is then of the type (25) and the operation v'. .. v selects its 
lowest rightmost entry b, ,.,. Consequently one has c, = b,_,-, and (42) 
becomes 
5—1 5—2 
L = техр(—2 а, /) (nA jug 
(s— 1)! (s— 2)! 
For a #0, the dominant term of (45) contains /°-! and so does the dom- 
inant term of (48) whose sign is v because bg is a square. For ag = 0, the 
dominant term of (45) is 1—2; but one then has bo = b, = 0 by (47) and the 
dominant term in (48) is £57? and has again the sign of т because its coeffi- 
cient b, then reduces to a,? in (47). By induction, whenever the dominant term 
in the state is £57 (i = 1,2, ..., s), the dominant term in L is t57?!** and has 
the sign of т for i < (s + 1)/2, but one has L = 0 for i (5 + 1)/2. Con- 
sequently for any initial conditions exciting only the mode corresponding to 
the Jordan block J, associated to a real eigenvalue, either L is identically zero, 
or L decays asymptotically to zero through positive (negative) values for 
t = +1 (= —1), so that the mode can be characterized as dominantly 
magnetic (electric). 


+++). (48) 


7. Scattering description 


We now describe the closed network as a lossless r-port terminated on some 
number r of unit resistances and characterize the r-port by its symmetric para- 
unitary scattering matrix S(p), of degree т. If a (b) denotes the incident 
(reflected) wave vector, the network equation b = Sa for a forced state 
exp (p t) can also be written a = 57! b or 


а = S(—p)b (49) 
and reduces to S(—p,) b = 0 for а free state exp (p, t), hence in particular to 
Sla) b = 0 (50) 


for а real mode exp (— o, t). Moreover, since the scattering description and 
the state-space description must yield the same eigenvalues and the same num- 
bers of independent eigenvectors, the matrices H + p; Im and S(—p;) must 
have identical nullities. Since the nullity of S is at most its dimension r (the 
number of resistances), one has the restriction 


r > max, null (H + p, lm). (51) 
Consider again a forced state exp (p t) with p=«a-+ jw. The reactive 
power 20 (Т, — Te) = Imi v entering the r-port expressed in the vector b is 


p EE Sb 


m^ ie 


8 jo 62 
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For a real state (w = 0, b real), (52) reduces by l'Hópital's rule to 


dS(—o) 


a 


4(T,— T) =—b' b. 


For a simple real transient exp (— о; t), the instantaneous Lagrangian is then 


dS(a;) 
4L — Б b exp (—2 a t) (53) 
0 
and its reactivity index is 
ds 
т; = sgn b’ — b (54) 


do, 


where b is the only solution of (50). 

For a network containing only one resistance, S reduces to a scalar all-pass 
function s(p), and (54) is the sign of ds/da, at the positive real zero о; of s(p). 
If sgn s(«) is plotted versus о, one thus has v > O (т < 0) at a simple zero 
where the plot looks like fig. 14 (fig. LB). If multiple real zeros occur, (51) 
with r — 1 forces each eigenvalue to be fully defective, so that multiple zeros 
can be treated as arising from the confluence of simple zeros. The asymptotic 
behaviour of the instantaneous Lagrangian corresponding to several distinct 
simple modes is determined by the slowest decaying mode, i.e. by the smallest o, 
and this remains true by continuity after confluence. Consequently for a zero 
of odd order resulting from the confluence of a pattern ABA...(BAB...), 
т is postitive (negative) so that the rules of fig. 14 (fig. 1B) hold after confluence 
for odd orders. For a zero of even order originating from AB. . (ВА...) the 
signature plot after confluence is fig. ІС (fig. 1D) and т is then positive (nega- 
tive). If the distinct eigenvalues are ordered such that 


0,2905... > 4, (55) 


and if the multiplicity of œ, is called 7, it results from the plot of sgn s(«) 
inspected in the reverse direction, i.e. starting from 


A B C D 


Fig. 1. (A) zero of odd order, т > 0; (B) zero of odd order т < 0; (C) zero of even order 
т> 0; (D) zero of even order t < 0. 
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sm) au euet (56) 


that one has by recurrence 
T, = (—1)17!0,; Ti = (—1)" v, (Gi = 2, 3,..., 4) (57) 
and this is equivalent to 


dis 
т, = (—1)"-1 sgn Y 


: (58) 
a," 
We now return to the general matrix case of (50) and (54) and assume that 
S(a,) has some nullity л. Since S(a,) is symmetric it can be diagonalized by a 
constant orthogonal transformation to be disregarded since it leaves invariant 
the r resistances. After the transformation one has 
0 0” 
$(о{) = | | (59) 
0 X j(r-n) 
and the solutions of (50) are the unit vectors along the first л coordinates, so 
that only the principal submatrix [dS/do,;], of order n of dS/do, is relevant 
in (53). Since (59) is still invariant by an arbitrary orthogonal transformation 
on the first n coordinates, one can also diagonalize [dS/do,], which becomes 
diag (0,, 6,,..., 6,}. If all 0, 40, the n solutions of (50) inserted in (54) 
produce п non-zero indices 


т, = sgn б (60) 


thus characterizing п distinct non-secular modes at «,. If the first k, say, д, 
are zero, only п — К non-zero indices are produced in (60) and the first k 
vectors correspond to secular modes. One can then diagonalize the submatrix 
[d?S/da,?], by an additional orthogonal transformation which does not alter 
the previous submatrices [S(«,)], and [dS/da;], since these are zero. Let thus 
[d?S/da,?], = diag (ei, £2, ..., Ex}. If all e, zz 0, one has separated the prob- 
lem into k distinct scalar secular problems of order 2, and the first k still missing 
indices of (60) are given, in accordance with (58) with n, — 2, by 


T, == —Sgn 8. (61) 


If some e, are still zero, one continues the process by considering successive 
derivatives of higher orders, until a complete set of п non-zero indices т; is 
obtained. 


8. Bounds on the number of resistances 


For a prescribed transient behaviour to be realizable by a network containing 
only one resistance, it is necessary by (51) that all multiple eigenvalues (if any) 
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possess only one eigenvector. In addition, adjacent pairs of reactivity indices 
must comply with the second relation (57). The first relation (57) can always 
be satisfied, for it merely determines the sign of (56), i.e. the ambiguous sign 
in the scalar reflectance 


s(p) = + R (62) 
g(p) 


where g(p) is the specified network determinant. The conditions are thus also 
sufficient and the state-space realization of (62) is known explicitly 5). The 
resulting hybrid matrix is in fact a permutation and renormalization of a tri- 
diagonal matrix discovered by Biickner °) and interpreted in network terms 
by Bashkow and Desoer 1°), Biickner’s matrix has been rediscovered by Puri 
and Weygandt !!). The recent discussion by Chen +?) is erroneous in the de- 
rogatory case. А slight variant is the Schwarz form discussed by Barnett and 
Storey 13). | l 

When the conditions of the previous paragraph are not met, more than one 
resistance is required and we wish to find the minimum. We first consider only 
the set of real modes (55) and call aj, bi, c;, d; the number of modes at a; 
having the behaviour of fig. 1, A, B, C, D, respectively. The problem is to 
construct a scattering matrix of minimum dimension accepting the prescribed 
pattern for its real eigenvalues and is solved by following the signature of the 
matrix on the real p-axis, thus generalizing the similar procedure used in the 
Oono-Yasuura synthesis +4). One must link together the prescribed number of 
figures A B C D at successive (not necessarily adjacent) «, in the natural order 
so that the input and output + signs are matched at every junction, while 
keeping minimum the total number of chains generated in the linking process. 
Let some number of chains be already formed which have exhausted all modes 
from a, to о; ; and let р, ; (2:41) designate the number of positive (negative) 
signs appearing at the outputs of the p,,, + ту chains. In crossing a, a, ++ d, 
positive outputs sign are generated and at most b; + d; among the p,,, past 
positive output signs can be consumed, if p,,., > b, + dj, so that the number 
of inherited positive signs is at least f (р; — b; — dj), where 


fG) » fo x>0 
x = 
0 for x <0. 
Finally, one has 
Pi >а + 4 f(pisi— b — d) (63) 


and a similar reasoning on the negative signs yields 


n > b, + e +f (nia — @— c). (64) 
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The number of chains produced at the end is p, + nı. Since the right-hand 
sides of (63) and (64) are monotone non-decreasing functions of р, +; (Ж +1), 
all inequalities operate in the same direction, so that any strategy producing 
equalities in (63) and (64) at every step is optimal. The simplest strategy then 
starts from a, + b, + с, + d, chains after а, and proceeds by consuming in 
an arbitrary way the largest possible number of past positive and negative signs 
at every new о; and by initiating new chains only when strictly necessary. One 
thus constructs a diagonal scattering matrix of order r = p, + nı. Complex 
pairs of conjugate eigenvalues can then be distributed arbitrarily among the 
existing diagonal entries if (51) is satisfied, else the size of the diagonal scattering 
matrix must be further increased and is then determined by (51). 


Appendix 

We consider perturbations of the equivalent networks of hybrid matrices H 
and F, whereas the reactances which are normalized to unit values remain 
unperturbed. We thus change H into Н +- ôH and (1) into F + ôF where 
ôF = T-t ôH T. By requiring F+ ôF to remain reciprocal for arbitrary 
reciprocal perturbations of H one establishes 0H U = U 6H, where U is (5), 
for all 6H such that 0 dH = 6H’ 0. By taking ôH diagonal one proves that U 
is diagonal, and by taking 6H proportional to various permutation matrices 
one establishes the equality of all diagonal entries of U, so that one has 
U —a1,. We now set a = + b? and 


T = Tib (A.1) 
so that (5) becomes 
T, 0, T,' = + 0. (A.2) 


The transformation (A.1) from H to F thus appears as the product of T, by 
the scalar transformation 1/b which cancels in (1) but changes x into x/b, so 
that it must be interpreted as a change of the unit (watt)!/? of measurement. 
By requiring both equivalent networks to be described in terms of the same 
unit, one forces b = 1, hence T = Т,. On the other hand, by Sylvester's law 
of inertia, 0, in (A.2) is a permutation of + 0, say + P 0 P', so that (A.2) 
reduces to 


ТРӨР'Т'= 0. (Аз) 


If one now requires the transformation from H to F to depend continuously 
on some parameters, so that T changes gradually from 1,, to T, (A.3) must also 
hold for T = 1, so that one has P 0 P’ = 0, and (A.3) reduces to (7). We 
have thus proved that (6) and (7) hold for continuously equivalent closed 
reciprocal networks, and the invariance of (8) for corresponding initial con- 
ditions (x changed into T x) results. 
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Abstract 


A short review is given of the balance and constitutive equations in the 
theory of the interactions of electro-magnetic and elastic fields. Part of 
the constitutive equations is derived from thermodynamic potentials. In 
the classical theory these potentials are functions of the instantaneous 
values of the independent parameters. The theory is extended to the 
class of materials for which the thermodynamic potentials are func- 
tionals, defined in the domain of the histories of the parameters. This 
class of materials exhibits relaxation and retardation phenomena. After 
a general formulation for elastic materials with electro-magnetic relaxa- 
tion, the theory is confined to linear relaxation, in which the functionals 
arequadratic. Inthe last section the special case of piezo-electric retarda- 
tion is discussed. 


1. Introduction 


This paper may be considered as a generalization and continuation of a recent 
paper !) by the present author concerning the general theory of electro- and 
magneto-elastic interactions in continuous media. In ref. 1 some attention was 
given to irreversible processes, like magnetic dissipation in ferromagnetics. Here 
we investigate on the basis of the modern theory of non-equilibrium thermo- 
dynamics, the relaxation phenomena which occur in dielectrics and magnetic 
materials. 

Our considerations are relevant to polarized and magnetized materials, 
saturated or nonsaturated, whether conducting or not. Such a generality is not 
necessary for the application to special problems, e.g. dielectric relaxation, but 
it throws light on similarities and distinctions between electric and magnetic 
behavior. However, we restrict our discussion to the non-relativistic theory for 
the purely elastic material. 

The literature on the interactions of electric, magnetic and elastic fields is 
extensive. In ref. 1 we present a list of important contributions. In this paper we 
found our considerations on Chu's new formulation of electro-magnetism °). 
The basis of Chu’s theory is the assumption that a material body, in rest or in 
motion, may be considered as a set of electric and/or magnetic sources, placed 
in vacuum. Chu needs two electro-magnetic vectors for the description of his 
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field. An extensive treatment of Chu’s theory with comparison to some other 
electro-magnetic theories and applications to relativistic and nonrelativistic 
problems, may be found in a book by Penfield and Haus 2). 

Chu’s formulation of Maxwell’s equations in rationalized MKS units is 


oH 5 
curl E = —Ho WE Ho 0 M, 
t 


dE А 5 
кыш. сей Е; (1) 
| [4 
Eo div E = —div (oP) + Oei; 


Ho div Н = —div (Ho 0 M), 


where E and H are the electric and magnetic field intensities, respectively, P is 
the polarization per unit mass, M is the magnetization per unit mass, o is the 
mass density, eo is the permittivity and xo the permeability of free space, о, is 
the density of charge and J is the current density. We have defined the time 
derivative P by 


rM a P) 


+ curl (o P x v), (2) 


where v is the velocity of matter. The corresponding definition for M holds. 
In fact we have 


oP = o P — v div (o P), (3) 


* 
where o P denotes the convected time derivative *). 
In the nonrelativistic theory the convected electro-magnetic vectors are 


m H — (v x & E), 


= E + (vx Н), (4) 
P =P, M*=M, 


and these are invariant with respect to a Galilei transformation, as defined by 
d д а а 
уу +, 1—1, у у, Б E r9 (5) 


where b is a constant vector. 

In matter we have to distinguish the following electric forces: E, the field 
intensity, i.e. the force on a unit charge in vacuum in a field with the prescribed 
sources, E*, the convected field intensity, D, the dielectric displacement, defined 
in the usual way, С, the effective field intensity, i.e. the work coefficient in 
the energy-balance equation. Corresponding definitions may also be given for 
the magnetic forces. Further we have to introduce f“™, the ponderomotive force 
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density. We note that none of these vectors are directly accessible to measurement. 

In sec. 2 we derive the balance equations from a principle, first stated by 
Green and Rivlin 5), that may be formulated as follows: The first law of thermo- 
dynamics, the energy-balance equation, is invariant under superposed rigid- 
body translations and rotations. This principle has proved its usefulness for the 
derivation of the known balance equations in the theory of elasticity and the 
theory of the Cosserat Continuum. In sec. 3 we obtain some constitutive equa- 
tions by the methods of thermodynamics. In sec. 4 we confine ourselves to the 
class of materials with linear relaxation equations. In sec. 5 we discuss a special 
case, the elastic dielectric, and in sec. 6 we apply the theory to linear piezo- 
electricity. 


2. Balance equations 


We consider a body that is placed in an external electro-magnetic field and 
is loaded by body and surface forces. The energy-balance equation can be 
written in the following form: 


d f uart} av4 3. [war 

— — Vy V — = 

dr fe di |] em dv J 
y y y 

= [osi о, av + [ Tv. ds—[G e & av — | c, o M, dV + 
y S V y 


+ f QB, as + ог» n. as - [ orav — [ nas [ хаз, (9 
S S V S S 


In (6) the energy-balance equation is applied to an abritrary finite part of the 
body with volume V and bounding surface S, moving in such a way that it 
always consists of the same particles. Thus d/d¢ denotes a material derivative, 
which is also indicated by a dot. In (6) U is the local internal energy per unit 
mass and W is the electro-magnetic field energy density. The mechanical body 
force is denoted by f(***? and the stress vector by T. The quantity r denotes 
the heat supply per unit mass and unit time, A is the heat flux, 9% and Q™ 
are the electric and magnetic surface vectors, respectively. X is an extra energy 
supply due to the electro-magnetic field outside of V. It contains, for instance, 
the Poynting energy flux and the dipole interaction energy from outside V. We 
note that Y is an unknown of the theory that has to be determined. In (6) the 
interaction energy is split into two parts 


fevar and [ur 


У 
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The first integral accounts for the short-range local energy, the exchange cou- 
pling, the spin-orbit coupling, near electric and magnetic dipole interactions and 
the deformation energy. The second integral comes from the long-range inter- 
action, e.g. the dipole-dipole interaction within V and the external field energy. 
Dipole-dipole interactions of particles within V with particles outside of V are 
included in fX dS. 

S 


According to Chu's theory we assume for W the expression 
W = 5 6 E? + $ uo Н?. (7) 


The condition for (6) to be invariant with respect to rigid-body transformations 
yields the following balance equations: 


0 + O Vk, = 0, (8) 
Тал + Жш + p f, C = 09% | (9) 
M, = I'eyn Mu Gar + Deas Pu Gn, (10) 


for mass, linear momentum and angular momentum, respectively. In (9) f, C0 
is given by (if о. is put equal to zero) 


p f, A = Ep," o P, + uo (0 P хня), + до (J XH"), + 


— £o Ho (0 M XE*), + uo Hr * о Mi, (11) 
which may be derived from a Maxwell stress tensor fx according to 
pfi = thar (12) 
with 
tyr = E,* Di” — 3 €o Е*? б, -+ H,* В* — $ uo Н*? by. (13) 


In (10) Г is the gyromagnetic ratio. With (8), (9), and (11) the balance equa- 
tion (6) simplifies to 


fe Ойр — | (E*— 6,9) o BAV + | (uo Ht — G0) o MAV + 
y У y 


+] Bet nav [ Ta Mi dV + [ Q, M, àS + | erav— | has, (14) 
LA У S У 5 

if Q® is put equal to zero. 
For the derivation of (8) to (13) and many other details, see ref. 1. 


3. Constitutive equations ; 
We introduce the specific entropy 7, that satisfies the Clausius-Duhan in- 
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equality 
d or h 
ке ау— [—av+ dS >0, 15 
а Је "an 5 < Wm 


where 0 is the (absolute) temperature, and the Helmholtz free energy p by 
y = 0 — 0 1. (16) 

Eliminating ғ and h from (14) and (15) we obtain the inequality 

J(e [ф + Ön + (6,0 — Е,*) P, + (Ge — po Hi") М] + 


y 


hy 0 
ВА Tum Ea [ 9: a, as + | ау «o. (17) 
S y 


We assume that y depends on the instantaneous values of 0, хк, Рк, Mis Мк, 
but is a functional of the histories of P and M. We write 


v) = V(6(t), Xk, a(t), P(t), Pur 8), M(t), Mt = 5), МЕ «(ї)), 
0<s< oo, (18) 


where хк and Mg, are defined by 


OX, oM, 
Xk a = , My, = , 
OX, OX, 
with X, the Lagrangian coordinates of the particle. 
To make any further progress we have to make some assumptions concerning 
the functional У. We introduce 


(19) 


P,{(s) = P(t — 5), My(s) = My(t — 5) (20) 
and assume that the functional derivatives 
d 
Op, P(o, Xk,as Py, Py, Мк, My, Mya) = de PP + € я), 0» (21) 
es 
d 
дм, PCO, xx, Ру, Pi, My, Mi, Mial et) = de (My + € ш"). (22) 
do exist іп the domain of histories. Then we may write 
d) Е ТЕ ЕРА Я 
i= — Хат FETU „a 
k a эз т OMe ^ Me 
+ dp, YC.. ГЁ’) + дм, V . AMi). (23) 


Because the functional ¥ has to be invariant with respect to rigid-body rota- 
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tions, it must be of the form 
YP = P(o, Aap» Cups Nu Né, Sas 5.) (24) 

with ` . 

E = Xka Мв Cag = Xka Xk, Bo А (25) 

Na = Xka Мк, S, = Xka Pr . 


From (24), (23) and (18) we derive the constitutive equations 


ov ov ow oV 
Ты = 0 Xi, (= Мв + = M: + 2 Хк,в + ср), (26) 


ФА, oN, Cus oS, 
oV 
G,O = E,* — Xka» (27) 
Р os oV P 1 ( ow ) (28) 
= = Хк, = Xka X1, , 
к = Шо lik ON, k, Р VM ka 1B s 
ov 
О, = p Хк, X1,p №. 09) 
aß 


In (26) Ту is the reversible part of Ту and in (28) R, is the nondissipative part 
of Ск“), Introducing magnetic dissipation, we can write G, as follows: 


б“ = R,— 9 (M, — erim 91 Mm), (30) 
with 7 the coefficient of “magnetic viscosity”, ешт the alternator and c, given by 
@ = $ ёа Vap i (31) 

It appears then that we also have to decompose Т, according to 
Та = Ты + Tu”, (32) 


with 7',,* the dissipative part of Т. We find for the skew-symmetric part of T,;* 
(cf. ref. 1) 


Trn” = 8.7) (Mex — &tkpq Op Ma) Mn]. (33) 


With (26) to (29) the inequality (17) simplifies to 


; , hð, 
J (Tu зы + Bet А + ел [he хм, г ar 
y 
—óy, PC.. JN 4)— bs, PC.. JŠ) > 0. (34) 
From (34) we conclude 
Толу v, 2 0, | . (35) 


e 7 (М, — (o хМ),] [M; — (®хМ)„] > 0, . Q6) 
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hy Ox 
Е*Л————>0, (37) 


ôn, V <0, (38) 
ôs, У <0. (39) 


These inequalities are important. Physically they express the fact that in irrevers- 
ible processes the dissipation is nonnegative. It will appear that especially (38) 
and (39) considerably limit the number of admissible constitutive equations. 


4. Linear relaxation 


The preceding theory is very general. To be able to make explicit calculations 
we shall confine our discussion to a smaller class of materials, the materials with 


linear relaxation equations. For this class we assume the free-energy functional 
to be quadratic іп № and $'. We write 


v = Pol, Aap» Сов, nm Sa) + 


+ л, Aap» Сав, S, 5) [N.'(s) — М„(ї)] ds + 
0 


+ [96 Ам, Cap Nas 8) 8G) — 8401 ds + 


0 


++) Јас ы) Әби) — NOTIS GA — NO ds, ds; + 
0 0 


+4] Јао.) ING) — NAOMISGA — 5,01 ds, з» + 
0 0 
Haf [96.1 s WN) — NAON Sa GO — S(O] ds, dsa + 
0 0 
++] ffs? .5 5152) SED — S408) — 50195, de. (40) 
In (40) we have 
Sagt. ..3 S45 s2) = fo. . +3 S25 51), 


Fos «5 $15 82) = fa PC .; 525 54), 
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and corresponding equalities between the other f functions. The conditions 
(38) and (39) yield | 

Ж? =f, = fog?” = fag? =Y, (a, В = 1;2, 3), (42) 


together with the dissipation inequalities 


| J Ра 90. з, 52) №) [Ng(t— 52) — Му(@)] dsi ds, <0, (43) 


о о 


о © 


f Í Fag? «5 S452) S(t—5) [Sat — s2) —S,()]ds, ds, «0. — (44) 
0 0 


We introduce the functions 


с 


Sap 9X. . $3 $15 $2) = [fa . 95 53, 52) дз, (р, q = 1, 2) (45) 


51 


and write (43) and (44) after partial integration in the form 


о © 
[ [54996-55553 Mele — 51) 524545 20, 46) 
0 0 


f [EP iss) $€—5) $6 —5) ds, 20. — (4) 
о 0 


To simplify the constitutive equations, we assume the f,,? and g,,9 func- 
tions to be independent of 0, 4,5, Cag, Nas S,. Then we derive, with the aid 
of (26) to (29), the following expressions: 


Oo Фо оро Oyo 
Ty = 0 Xia | —— Mig + —— My 4-2 ; —P,)+ 
kl 0 Xi (= k,B ON, k on 25, ) 


— 0X10 M, | 8.89100, 5) [NS (t — s) — N,(t)] ds + 
0 


со 


— o xi, Py f 8457? (0, з) [5,02 — s) — S,(t)] ds, (48) 
р | 
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d Е с 
хы җы | BaP, 8) [бй — з) — 5,0 s, 
0 


а 


G,© = E,* € 


N, e 


+ хы] 85090, з) [М — з) — №01 ds. 


0 


(49) 


(50) 


For the complete theory we have to add eq. (29) and the balance and field 
equations. There results a nonlinear theory in which the partial differential 
equations are replaced by integral-differential equations. For a discussion of 


the boundary and jump conditions, see ref. 1. 
5. The nonlinear elastic dielectric 
For the discussion of the elastic dielectric we take in (6) 
С" = М = Qo = Qo» = 0. 
It appears that for many cases of practical interest we may put 
GG = J — 0. 
The inequality (17) simplifies to 


А : h 
Јо + in- E* Р) — Tuv dar + f : 
Y Y 


0 


Putting v as a function of 0, С.в and S, and a functional of S;': 
y = V(0, Cap» Sas S, (s), 0 <5< со, 


we find the constitutive equations 


ow 
1 = эб > 
Е (2 oV js ae 
= Q Xia ———X ‚® = , 
кї 77 0 X1, T k 55, k 
ov 
E,* = Xka 


a 


йк 
dV <0. 


(51) 


(52) 


(53) 


(54) 


(55) 


(56) 


(57) 


The independent variables in these equations аге хк„ and Pp, from which Cag 
and S, are calculated. Sometimes it is expedient to change the variables and 
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to consider х, and E,* as independent. To this end we introduce the electric 
enthalpy 
e x —0wv— E,* Pro. (58) 


The inequality (53) becomes 


h 0, 
[ eG int В*Рәд— Tunlar [7,72 hay <0, — (59) 
y 
from which we derive 

òy 
Р, = — > 60 
ePi ета (60) 

ox 

Tu = © Xia (61) 

Xka 


We consider a thermodynamic process that is isothermal and closed, i.e., there 
exist two times г, and t, (>œ 1,) so that for t x: t, and for г > t, the states 
of the system are equal. Then we have the local inequality 


0X + Ex* о Р„— Ta v, « 0, (62) 


from which there results for this process 


| P, É,* dt— [2 Ты vidt < (63) 
because 
| ХЧй=у| =0. (64) 


While a discussion based upon the ¥ functional with its variables х, „ and Р, 
leads to the description of the relaxation of E,*, the y formalism yields the 
retardation of the polarization in dependence on the electric field. 


6. Application to linear piezo-electricity 
The balance equations for a linear piezo-electric body are 


до 
э T 0o = 0, (65) 


Ov, 


Тыл + 0o f, = 0o — э? (66) 
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Tu = 0, (67) 
E;* = Ex; (68) 


where оо is the density of mass for the unloaded state. In (66) and (67) we have 
omitted the terms оо E,,, Р, and оо Er, P, respectively. Both consist of two 
parts: a contribution from an external electric field and one from the residual: 
field. We take the external field uniform and constant in which case the first 
part vanishes. The other part is of the second order and has to be dropped for 
consistency. 


In the linear theory P, always occurs in the combination oo Р,. Therefore 
we transform here 


Qo Р, — P, | (69) 
and we shall define in this section P as the polarization per unit volume. We 
confine ourselves to isothermal processes and assume the enthalpy of the form 

Qo X = $ ci Cry ёми —S ets Ex € — X ёи Ex Ey + 


о © 


zl | Kalsi 52) [E(t — 51) — EX(t)] [E(t — 52) — E(t)] ds, о, (70) 
ò o 


where e;; is given by 
еу = $ (шу + иу). (71) 


In (70), сизи are the coefficients of elasticity, fı; are the piezo-electric coupling 
coefficients and =, are the susceptibilities. We have 


Eki == Eik; Ky(5, 52) = Ku sa, 51). (72) 


In (71) e; is the linear deformation tensor and и, is the displacement. From (60) 
and (61) we derive the constitutive equations 


Prt) = feig eig + ёи Е, -f | Kalsi» 52) [E(t — 52) — E,(t)| ds, dsa, (73) 
о о 
Ты = Cay Cu— ш Ei. (74) 
The dissipation condition (39) that takes here the form 
yields 


с со 4 
| [ Кы, $2) (s E(t — 2) [E(t — 51) as E,{t)\ ds, ds; <0. (76) 


о 0 
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We introduce © 
Lily, 52) = | Ky(55, 52) dss 
51 
and represent (73) as 


P, = fu; €) + & Е, — 140, 5) [E(t — 5) — Е|(т)] ds, 
0 


while (76) may be written as 


f І.(51, 52) Ex(t — s1) E(t — s2) ds, ds; « 0. 
0 0 


` The inequality (79) holds for all t. 
To interpret the meaning of (76), we write it in the form 


со © 


I(t) = J йз, ] ds, Ку($,, 52) ae E\(t— s2) [E,(t— 51) — E,{t)] 
E E ds; 


d o со 
me (sf ds, [ ds; Кү(5,, 52) Elt — $2) E(t — 54) + 
dt 0 0 


-Í ds; | ds; K,i(s;, 52) E(t — 52) A) + 
0 0 


o o 


— f às | а, Kus 52) Et — зз) Ё) <0. 
0 0 
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(77) 


(78) 


(79) 


(80) 


We now integrate I(t) over a closed cycle starting from the virgin state. We find 


t=% 


[10 dt = 7i Гк, s2) E(t — 52) Е = 51) ds, ds; 
6 о 0 


ї= — со 


о с 


-Í 6, $2) E(t — s2) E(t) ds, ds; + 
о 0 


t-o0 
t= 


—00 


о œ 


— [EO dt f [ Kus 2) Et — 52 ds, ds, <0. 
—© о 


о 


(81) 
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The first four terms at the right-hand side of (81) are equal to zero. For 


t = — oo this is trivial, for £ = +00 it is a consequence of 


E=0 fo t>t,>0, 


Ku(s,,52) — 0 for | s,— or s, — co. 


It appears that we have for this kind of processes 


| I(t) dt = — i | 1.10, 5) E(t — 5) E,(t) ds dt. 


We now go back to the inequality (63). We have 


со | со со d 
[®®&й— | Tu vidt = [би Cu Ee) dt + 


=p -00 


+ & — (E, E) dt — c — (ei; e) d + 
kt Т k ^l ijkl Р tj ORL 


— а | f Kalsi з) Ui — 5 — E01 ds, з, <0. 


-0 о 


For the closed thermodynamic process this becomes 


со со o 
[ret à— | та Uk. dr = | (0) dt « 0. 
Е) -%о — о 


(82) 


(83) 


(84) 


(85) 


We represent the constitutive equation (78) in another form. We introduce 


guls) = | Zul, p) dp 


and with this function (78) becomes 


со 


Р, = fag €g + га Ё + f uls) [E(t — 5) — Е()] ds. 
о 


(86) 


(87) 
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From (87) some other expressions for P, may be obtained: 


Py = ш еи + leu + BuO] Ei + | du) Et — з) ds, 
° 


со 


P, = frij € + Eri Е + КЪ ЕІ — 5) ds, (88) 


0 


Р, = fy; Cy + ёа Ey + ЕС Ё|() ds. 


-00 


It can easily be seen from (87) that for t — оо the susceptibility is the sym- 
metric tensor Ept 

The instantaneous susceptibility is гш + g,,(0). It is the coefficient if E 
suffers a jump. We show that g,,(0) is a symmetric tensor, negative definite, 
and that both в, and ғи + g,,(0) are positive. 

Coleman б) has given a proof of a theorem, which states that for a visco- 
elastic material, all total histories ending with given values of x, , and 6, that 
corresponding to constant values of x,,, and 0 for all times has the least free 
energy. Physically this means that the free energy tends to its equilibrium value 
and that this is a minimum. Basic for Coleman's proof is an inequality that 
corresponds to our (39). 

We may state a corresponding theorem for the function oo y. As а con- 
sequence of (75) it takes its minimum for constant histories, ending with given 
values of e,, and E,(f). Thus we have 


о о 


[| «6508 — 50 — ЕДЮ] IE — з) — E) ds, ds, <O, (89) 


о о 
for all admissible histories. We apply it to the history of fig. 1: 


E(—5)—E( s=0, (90) 
E(t—5)—a , E<S< o, 


Eft-s)|. 


0 e 
— S 


Fig. 1. History of E(t). 
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while for 0 < s < e the function is smooth. If we let e — 0, we obtain 


oo 


oo со 
| f Ky(5, 52) ак a, ds, ds; = f 2.0, S) a, a, ds = 21(0) a, a, <0, 
0 0 о 

а, constant, (91) 


the 2, (0) tensor is negative definite. It may easily be seen that оо у tends to 
its equilibrium value for t — co under constant continuation of the E history 


after a finite time. 
We now derive an expression for the free energy оо y by means of 


ox 
Qo V = Qo X + Pr Er = 00 X — 0o —— Er (92) 
ФЕ, 


We obtain 
0o V = $ суи Cry ёк: + $ ғи Ёк E, + $ 280) Е, E, + 


—4 f Kalso s) 0—5) Ets) dsi ds, 03 
о о 


If E(t — s) is а constant history we arrive at the equilibrium value of oo vy, 
o V = 5 cia ёу ём F 5 Eri Ёк Е, (94) 


and this has to be positive definite. Thus e,, is a positive tensor. Further we 
see that 
O?(0o v) 


E 95 
dE, dE, 09 


Ext + 28.00) = 


taken with E(t— s) constant. From (95) follows 
8.0) = #1«(0). (96) 
We apply again the history, given by (90). Now (93) becomes 
Co V = $ Суы ёу ёш + $ [ёш + g4(0)] Er Er — 3 Ea) а, а. (97) 


Because оо y has to be positive definite for all values of E and a, we con- 
clude again that g,,(0) is negative definite, while ¢,, + g,,(0) is positive. Note 
that for sufficiently small values of lal. бо y, according to (97), is smaller than 
its equilibrium value (94). However, this does not violate the well-known law 
of physics, because og y obtains its minimum for constant histories of P. 
Thus in the expression for ооу, E, is a “wrong” variable. For the process 
(90) the polarization Р, becomes for t — oo 


Py — frig €) + [гы + g4(0)] Е, — 2„(0) a (98) 
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which is smaller for sufficiently small lal than the equilibrium value of P, 
corresponding to (94). 

But even expressed in the wrong variable E,, оо y has to remain positive, 
because it is here the internal energy of the body, loaded by mechanical and 
electric forces. If the excess energy with respect to the energy of the natural 
state (е, = E, = P, = 0), which we have taken equal to zero, changes sign, 
there may be loss of stability. We exclude this from our discussion. 

We now derive some other properties of the tensor g,,(t). We write the 
integral (85) as 


a со 


со 
о = |а f гш) BO E — s) ds < 0, (99) 
-%0 - 0 
and apply it to some closed thermodynamic processes. We take (cf. fig. 2) 
E = (0, 0, 0), t<, tS 
( ) 1 2 (100) 
Е = (а,, a2, аз), ty « t « ts. 
Elt) 
a 
ü t2 —— t 
Fig. 2. Path of E,(t). 
For this path we find 
| 4 | ds dus) а, 66 — 0 —0¢—1)] Es) = 
= о 0 
со 12—11 
= a.a | ёз) 0—0 —з)] = —а а, | ваб) дз 
о 0 
= —ак d, [grtz — t1) — &м(0)] < 0. (101) 
As t; — t, has any positive value, we have | 
Erit) ак а 2 ZrO) ак а, t>0. (102) 


In (101), H(t — s) is the Heaviside step function and ó(t — 1,) is Dirac's delta 
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Е) 


app----—--- 


to 


— { 


Fig. 3. Path of E,(t). 


function. Because every process may be built up of elementary processes of 
the kind of (100), (102) is a general statement. 
Another inequality may be found with the aid of (79). We apply it to 
E=(0,0,0) , t<t%, 


(103) 
E = (Q, а, аз), 12 to, 


(cf. fig. 3), which is not a closed process. We note that (79) holds for any 
thermodynamically admissible process. We find 


f [dsi а з зз) а, a òl — to — 51) 8l — to — з) = 
о о 
= ака, Lilt c to, t— to) < 0. (104) 


If we take t = to, we obtain from (104) 
—L4(0, 0) a, a, = &y(0) а, a, 2 0, (105) 


from which follows that g,,(0) is a positive definite tensor. 
It is not possible to derive more conclusions from (79) or its integrated form 
(85). For a closed thermodynamic process (95) may also be written as 


oo со 


| at f ds g(s) E,(t) E(t — 5) = 
0 


-%0 


= far авыз) ÈO Ё). (106) 


To prove complete monotonicity for the one-dimensional retardation function 
g(s) we need 


Í | g(s + t) a(s) a(t) ds dt <0, (107) 
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for all continuous a, defined on [p,q] (cf. ref. 7) and this does not follow 
from the inequalities we have derived. The physical meaning of (107) is that 
the work done on retraced paths is increased by delay. 

It is also impossible to conclude that Onsager's relations 


Ex(5) = gu(s) (108) 
hold. It is sufficient for (108) that 


Ky(5,, 52) = Ку(5›, 51), (109) 


but this is not required in the general theory. It may be proved that (108) holds 
if the work done on every closed path starting from the virgin state is invariant 
under time reversal 9). 

The theory may be applied to the investigation of the relaxation spectrum of 
dielectrics. Looking for harmonic solutions, we put 


Ты = T, exp (iot), Р, = P, exp (iot), etc. (110) 
For the amplitudes we now find from (74) and (88) 
Ta = Cia € — fua E, (111) 


P, = fij Ču + & Ё, + io Ё, Е exp (—iws) ds. (112) 
0 


With the aid of the Laplace-transform techniques, we may also state a principle 
that formulates a one-to-one correspondence of the Laplace transforms of the 
quantities in this theory with the corresponding ones of the classical theory. 
The “material constants" here are functions of the Laplace parameter. This 
technique is well known in the theory of visco-elasticity. 
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Abstract 


Uniformly valid (with respect to the independent variable) asymptotic 
approximations to the oblate spheroidal wave functions are constructed 
for large values of the wavenumber c. The emphasis is on qualitative 
accuracy (such as might be useful to the physicist), rather than on 
efficient algorithms for very accurate numerical computation, and the 
error factor for most of the approximations is 1 + O(1/c) as ct œ. 


1. Introduction 


Spheroidal wave functions appear to have been investigated originally by 
Niven +), although the spheroidal wave equation is intrinsically similar to 
Laplace's tidal equation for a global ocean, for the solution of which Laplace ?) 
developed his method of infinite continued fractions. Flammer °) gives a 
historical survey of spheroidal wave functions between 1880 and 1957, and I 
notice here only that Bouwkamp made many contributions, including a major 
improvement on Laplace's method, in a series of papers beginning with his 
doctoral dissertation ^). My own interest in the subject stems originally from 
that dissertation and was recently revived in connection with Laplace's tidal 
equation. It therefore gives me special pleasure to contribute this paper to 
Professor Bouwkamp's anniversary issue. 

Asymptotic approximations to the regular prolate functions have attracted 
some attention in recent years (Slepian 5), Streifer $), Cloizeaux and Mehta ?)); 
however, I know of no asymptotic results for the oblate functions other than 
those given in Flammer's monograph, which hold only in rather limited domains 
(see secs 2 and 7 below). In the following analysis, I seek uniformly valid (with 
respect to the independent variable) asymptotic approximations to the oblate 
functions for large values of the parametric wavenumber c, moderate values of 
the azimuthal wavenumber m, and all values of the zero-crossing index п. My 


*) This work was partially supported by the Atmospheric Sciences Section of the National 
Science Foundation (NSF Grant GA-35396X1) and by the Office of Naval Research. 
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` emphasis is on qualitative accuracy, such as might be of interest to a physicist, 
rather than on efficient algorithms for very accurate numerical computation. 
I give numerical comparisons with tabulated values of the eigenvalues and the 
radial functions in secs 2 and 3 but have been unable to find adequate tabula- 
tions to permit comparisons for the angular functions in secs 4—7. 
The angular functions for the solution of the wave equation in oblate 
spheroidal coordinates are determined by the differential equation 
42 A+ c? 7? 1 —m? 
E 2—1. acum] (1 — 7)? S, C-ic, 2) = 0 
d? 1—9 (1—12)? 
Cl<7<1), (0 
the boundary conditions 
Sinn = ога pu ?у"?] (n =E 1), (2) 
and the normalizing conditions 


Р"(0) 0 еуеп 
S, (ic, 0) = » Smn'(—ic, 0) = ( =п—т ‚ Gab) 
0 P™(0) odd 


where c is a real number, m and п are non-negative integers, n >т, 
A = Ann(—ic) is an eigenvalue, the primes in (3b) imply differentiation with 
respect to 7, and / is the number of zeros of S,,, in —1 < N< 1. S,, is an 
even/odd function of 7 for 1 even/odd; accordingly, it suffices to consider 
0 <7 < 1. The eigenvalue 4 may be either positive or negative, but it may 
be established from the (essentially Sturm-Liouville) form of the eigenvalue 
problem posed by (1)-(3) that 


A+ е? == (1 — о?) с? =f? c? >0, (4а) 
where 
02 = —A/c? = 1 — f. (4b) 
The corresponding radial functions, 
Ran? (—ic, i£) = Км (—ic, ig) + iR, (—ic, i£) (0 < ё < со), (5) 
are determined by the differential equation 
E с2 ё2 А m-—1 


аврат? аат (P + DH? Runic, if) = 0 


(0<#< o) (6) 


and the normalizing conditions 


Romn ®® ~ (с E) exp (+ і[сё— (п + 1) (21) (To). (7) 
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Ryn??? are real in (0, оо); accordingly, it suffices to determine Rmn‘? from 
(6) and (7), after which R,,0:?*9?) may be determined from (5). ` 

The radial function R,,‘(—ic, i£) is proportional to S,,(—ic, i); see 
ref. 3, eq. (4.2.2). It therefore follows from (3) that 


Кы (—ic, 0) =0 (1 odd), | (8а) 
Rmn (—ic, 0) =0 (1 even), (8b) 


where the prime implies differentiation with respect to &. These last conditions 
will be satisfied by the solution of (6) and (7) if 4 is predetermined by the solu- 
tion of the eigenvalue problem posed by (1)-(3); alternatively, 4 can be deter- 
mined by solving the eigenvalue problem posed by (6)-(8). We follow the latter 
approach in secs 2 and 3. 

The preceding definitions follow Flammer?) and Abramowitz and Stegun 8), 
The subsequent notation also follows Abramowitz and Stegun for other higher 
transcendental functions (except in the definition of the parameters and the 
separation thereof for the elliptic integrals). An error factor of 1 + О(1/с) 
is implicit for all asymptotic approximations except as explicitly noted (higher 
approximations for the radial functions are considered in secs 2 and 3). 

It is expedient to list here the following auxiliary functions, which enter the 
subsequent analysis at several points: 


$(0, В) = EO, В) — (1 — В) FO, p) <8 <1) (9a) 
= В Е(, V) (В 21), (9b) 
$(8) = $a. В) = В ECB), (10) 
219, В) = В LF, |118) — E, |5|/8)) (821), (11) 
and 
xB) = x, В) = B ІК(а|/8) — (|9110), (12) 


where E(—, —) and F(—, —) are elliptic integrals of the second and first kind, 
respectively, and E(—) and K(—) are the complete integrals. 


2. Radial functions: c f оо with m, n fixed 


The differential equation (6) has neither turning points nor singularities in 
0<é< o if A< 0, as it is for сї © with m and n fixed. This, together 
with (7), suggests an asymptotic solution of the form (cf. Erdélyi 11), 84.2) 


ASYMPTOTIC APPROXIMATIONS FOR OBLATE SPHEROIDAL WAVE FUNCTIONS 143* 


Ryn (ic, i£) = c7! (E! + 1) 4? exp ( У, Ge) 5) — iQ + 1) n/2), (13) 


s=0 
h O> (itos ;j (14) 
er (5-— со, . 
where б ( 5 2i 
Substituting (13) into (6) and equating coefficients of like powers of c yields 
A = (E + 1)-'® (E v0 (15а) 
fi = UD fo’; (15b) 
and 
5—1 
2 fo fs thor” + A fear’ + 82(т?—1)(#°-„1)-%==0 (sz2. (150) 
т=1 


We remark that f; (E) is even/odd for even/odd s. 
Invoking (8) and /,'(0) = 0 if s is odd, we obtain the eigenvalue equation 


со 


E C3'e-"f(0) = c (6) = тл2 (0<8<1), (16) 
r=0 
where T= к (: a (17) 
n odd 
The corresponding boundary values are 
-1 
Ran“ (ie, 0) = F( ; (18a) 
QG»X(. i = 
Rmn? (—ic, 0) e- (бу (18b) 
Rin” (—ic, 0) Y: н , (1да) 
ЏЕ(В) 
Ran? (—ic, 0) == с > (19b) 


0 


where the primes imply differentiation with respect to &, the upper/lower alter- 
natives correspond to even/odd /, 


E(B) = exp ( LOY c” fort 0) (20а) 


со —1/2 
= ( С) сао) , (20b) 


144* JOHN W. MILES 


ААА ————————————————————— 


and (20b) follows from (20а) with the aid of ‘the Wronskian relation 
RU Кы — Rmn Rast ert (£? + 1)71. (21) 


It follows from (16) that the eigenvalues and eigenfunctions are asymptotically 
grouped in pairs (the partners of which differ by terms that are exponentially 
small) as c f œ with m and n fixed (cf. ref. 3, § 8.2). It appears from the sub- 
sequent results (see sec. 3) that this confluence holds if and only ifn < c and 
not for either п = O(c) or n > c. 

Integrating (15) and invoking (14), we obtain 


JAE) = E (E? + 1)!? (E? + 02) 072 — ф [arctan (£/a), B] + Ф(8), (22а) 
where ¢(—, —) and $(—) are defined by (9a) and (10), respectively, 
fi = Flog [(£? + DIE? + «?)у], З (22b) 


fA) = В EE + 1)7 1 (E H o2)79P + 
—4 pt /@? + 1)- 3/2 (t? + «?у-5? t? dt + 
Ы 


45 0т2— 1) fe + 1)-3/2 (t£? + 02)-4/? dt, (220) 
[i 
and 
2/8) = —U-i fs ] 0 0,3) У f UA (O- (fo (dt (52:3). 
т=2 4 
(22d) 


Substituting (22a-c) into (16) and (20) yields the approximations 


Ф(В) = Ф(8) — c7? (Tom? — 0/28?] IK) — E(8)] + 
+ [(8 — 382)/24 82] K(B) — [(8 — 782/24 о? 8°] E(B)) + О(с-*) (23) 
and А 
E(B) = a-!? {1 +4077 a74 [82 — 2 (m? — 1) 021+ O(c~*)}. (24) 


Table I compares numerical values of 4 calculated from (16) and (23) for 
с = 10 with the values from ref. 10 and with the approximations provided by 
(25), (37), (42) and (55) below; (42a) is identical with the first approximation 
obtained by setting Ø = ¢ in (16). Tables II and III give corresponding com- 
parisons for the boundary values calculated from (18), (19) and (24). Further 
comparisons reveal that the corresponding approximations for c = 5 are 
satisfactory only for n — m — 0, 1, whereas those for c — 20 are in agreement 
with the tabulated values to at least three (typically four) significant figures for 
m = 0(1)2 and n — m = 0(1)8; however, the approximations ultimately de- 
teriorate with increasing п for fixed c. 
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Letting £? | 0 in (16) and (23) yields 
Ae? 270+ By +47 В c7 + О(с-), (25a) 
Во = 5 (т? — 1—т?), (25b) 
which agrees with ref. 3, eq. (8.2.11). Letting 2? | 0 in (13) et seq. yields 
RO) (—ic, i£) = 
=c71(& + 1)7 12 exp ( [c & + t arccot £ — (n + 1) 2/2] + y cms £e). (26) 


s=1 
where 


AiG) = 4 (e + i Bo £) (1 + er © (27a) 


and 


f4(£) = + {2° + Bo (22 — 1) +ivé [% Bo (E + 3)—2]} (1 + &)-?. (270) 


The expansion (26) appears to be simpler than, albeit equivalent to, Flammer’s 
expansion in irregular Laguerre polynomials (ref. 3, eq. (8.2.46). 


3. Radial functions: Weber-function model 


The differential equation (6) has turning points at £? = —g? that are signif- 
icant for its asymptotic solution if either A > c or A = O(c), corresponding 
to n 2» с or n= O(c). Only one of these turning points (& = le) is directly 
significant for A 2» c, whereas both are significant for 4 = O(c). An Airy- 
function model (see below) suffices for a single (isolated) turning point, whereas 
a Weber-function model is necessary for a description of both turning points. 
A Weber-function model that is uniformly valid for —оо < 4< оо and 
0 <&< oo may be constructed through the transformation 


dz 2 &2 + a? 
®—а)|— | = с? Я 28 
е a) PY i 
z(0) — 0, (28b) 
and 
w(z) = [c (E? + 1) z (£)P? R,, 9? (—ie, i£), (29) 
where a is determined by the requirement that 2? = —a? map on z? = 4a. 
Transforming (6) yields 
d? 
E + 422 —а + re) | w(z) = 0, (30) 
dz? 


where 


r= (m — 1) KE + D zT? + G) ^ (ey), (31) 


146* JOHN W. MILES 


Neglecting r(z), which is uniformly O(1/c), reduces (30) to Weber’s (parabolic- 
cylinder) equation. We proceed to show that the required solution, as deter- 
mined by (7) and (8), is 


even 
w(2) = exp [i ( « —4 22] E(a, 2) (i ) (32) 
odd 
where 
x(a) = arctan {[1 + exp (27 a)]!/? — exp (x a)) (33) 


and E(a, z) is the complex Weber function defined in ref. 8, eq. (19.17.6). 
Integrating (28) and choosing a such that 22 = —a? maps on z? = 4a, yields 


x(B) 
—$(a) 
a arccos (3 a7? z) — 4z (4a — z?)!? = c x [arccos (£/|a], 8] = $ c (—2)?^2, 


O< E< |, a0), (35а) 


pat аю | | (a? 50), (34) 


4z (z? — 4а)!/? — aarccosh (4 a7 1? z) = 

= c {E71 (E? + 1)!? (2 + a2)! — ф [arccos (|а]/®), В] 

= c [folé) — 4(8) 2 $cz?^, (62 |0 «20 (35b) 
and 

4z (22 — 4a)!/? — a arcsinh (4 |a|-?? 2) = 

= c {E (E? + 1)? (£? + a?)7?? — ф [arctan (£/o), 8]) 

= c [fo(5) — ф(8)] (6-0, a«0) (350 


where ф and x are defined by (9)-(12), fo(35c) = fo(22a), and the auxiliary 
variable z anticipates the subsequent approximation (45) for a 1 oo. 

Substituting (32) into (28), letting 5 and z 1 оо, and invoking the asymptotic 
approximations 


42? — 4a [1 + log G?/|aD] ~ с [E — #00)1 (36a) 


and | 
w(z)  (42)71/'?ехр {i [022 — alogz + 4 arg Г(% + ia)— іл #]}, (36b) 


which follow from (35) and ref. 8, eq. (19.21.1), we find that (7) is satisfied if 
and only if 


c (8) — Ka) F x(a) = (n + Y л/2 ( m) (37) 


where 
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h(a) = —h(—a) = } arg Г + ia) — 4a (log |а| — 1) (38a) 
= —4a (log |a| + 0-964) + О(а2) — (a— 0) (38b) 
~ (48 a)** + O(a^?) (a— oo). (38c) 


The corresponding boundary values are 


2 
Ry, ie, 0) = c7 |a|- 2 H(a) [1 + exp (2x a)]F ч“ k s ”) 


porn 46 
( odd ) 63) 


0+i 
Rig: —ic, 0) = |a|'? H7* [1 T exp (2 z a)}**/4 
( ) | | [ PQxa)] ( Do 


е 40 
ua! (40) 


where 
H(a) = |а| |Г@ + Zid rG + zia”? (41a) 
= 1:446 |a|*/* [1 — 0-916 a? + O(a*)] (а 0) (41b) 
~ 1+ (32 а2)- + О(а-*) (a>+0). (410) 


The functions /(a) and (а) are plotted in figs 1 and 2. Numerical values of A 
calculated from (37) and from the limiting approximations (cf. (16)) 
c E(P) = v 2/2 (a — —oo) (42a) 
and 


сф(8) = (п + 3) (0/2) (а ©) (42b) 


are compared with the ref. 10 values in table I. The boundary values calculated 
from (39) and (40) are compared with the ref. 9 values in tables II and III. 
The transition values of c obtained by setting 4 = a = 0 in (37), 


с= (" is | л[2 = Cmn” ( en (43) 
n+4 odd 


are compared with the interpolated ref. 10 values in table V. We note that 
letting f 7 оо in (42) yields 


A ~ n(n + 1) —4c? + OG, c*[n?) (п/с f оо), (44) 
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02 


% 0-2 04 06 08 10 


—— q 


Fig. 1. The function h(a), as given by (38). The approximation (38c) is in error by less than 
1% for a» 1. 


12 


0 02 04 06 0-8 ‚10 


— а 


Fig. 2. The function H(a), as given by (41). The approximation (41c) is in error by less than 
0:395 for a 1. 


which may be compared with the equivalent limit c — n (n + 1) as c/n} O 
(ref. 3, § 3.1.2). 


The limits a— + oo 


Letting a— —оо (cf oo with A < 0) in the preceding formulation yields 
(13) within 1 + O(1/c). Letting a 1 oo (c 1 © with å 2 0) and invoking the 
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Airy-function approximations given by ref. 8, 19.20 yields (after some 
reduction) (42b) and 

Rin (—ic, i£) = 

ЖЕЗ лі? с 5/6 [2 + 1) (22 + a?)z]- 1/4 [Ai (—с2/3 2) — і Ві (—c?/8 z)] (45) 
where 2 is defined by (35a, b). This approximation also may be obtained 
directly, rather than through (28), by Langer's method. 


Letting c f oo with £ — lo] 2 0 in (45) yields the outer and inner approxi- 
mations 


Ry, (Hic, i£) ~ 7* (E + 1)7 5 (E2 + 02) Ч exp {i [efo(é) + 


— (n + 1) xJ2]) [c (&— |a) > 1] (46a) 
~ —ic7! (E? + 1)7 ^ (|02 — £2)7 !/* exp (c x [arccos (£/|o], 81) 
[c «| — 2) > 11, (46b) 


where fọ and y are defined by (35b) and (11). We remark that (46a) also holds 
for a — —оо if f,(£) is defined by (35c) and then is equivalent to (13) within 
1 + O(1/c). We also remark that R,,,“ is exponentially small in c («| —ё) 1. 
The boundary values given by (46b) or, equivalently, by letting а 7 оо in (39) 
and (40), are 


| ехр (—с even 
Run? (—ic, 0) = с? lo|- i ( PC Do i exp (c 2) (: ) (47) 
0 odd 
and 


0 
Ryn (—ic, 0) = |о|]!? ( 


even 
+ iexp eD) ( ) (48) 
ехр (—с у) о 


where y = x(f) is given by (12). 


Second approximation: А = O(c) 


The formal asymptotic expansion for w(z), the dominant term of which is 
given by (32), may be obtained by regarding —r(z) w(z) in (30) as a forcing 
function and solving by iteration (using variation of parameters to construct 
particular solutions); however, the resulting integrals are intractable unless 
either n >c or A= O(c), and it then is more direct to specialize before 
proceeding to the higher approximations. The complete asymptotic expansion 
for —А > c is given by (13), which is uniformly valid for 0 < ë< oo. If 
either 4 = O(c) ог A>>c, (13) may be regarded as an outer expansion in 
с(&— la) >> 1 and matched to an inner expansion that is based on Weber's 
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equation if A = O(c) or Airy's equation if 2 > c (cf. (45)-(48)). We consider 

only А = O(c), this being the domain in which the preceding (first) approx- 

imations are least satisfactory. | 
Expanding (13) and (22) in 1/с with £ = O(1) and 


ао ziciAzm—ko?c-O(l (49) 


yields the outer approximation 


Ry, (—ic, i£) = R(E) exp [i o(6)], (50a) 
where 
RCE) = c7! £1? (E2 + 1) V^ [1 + 4 ao 07 #52 --O(c?)], (500) 
о(Ё) = с (2? + 1)!? + a arcesch £ — (n + 1) (2/2) + 
T c7! [8002 — $) 72 — a] (E? + 1)?? + O77), (500) 
and 
а = а + a ct, (51а) 
a, = $— т? — 4a)’. (51b) 


The notation of (51) anticipates the subsequent role of а in the inner expansion. 
We define the inner domain by & = O(a) and pose the inner approximation 
in the form (29) with (28) replaced by 


z = Qo)? Е + c^! z (5) + O(c7?), (52) 


in which the first term is the limiting value of z obtained by letting «—0 
in (28) with £ = O(a). Substituting (29) and (52) into (6), we find that choosing 


z) == —2-—7/? c? &(02 + 2&7) (53) 


reduces the differential equation for w to 


[= +4z?—a+ СЕ] w(z) = 0, (54) 
dz? 


where a is given by (51). The boundary condition (8) implies that the solution 


of (54) is again of the form (32), and the matching requirement implied by (50) 
yields the eigenvalue equation 


c + 4a log 8с — } arg IG + ia) F xla) + (&—а,) c^! + 0(c7?) = 
x 2 pe 55 
= (n-F 3) (2) (o) (55) 


The corresponding boundary values are given by 
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Run (—ie, 0) = [RHS(39)] |1 — (m — §) 471714 [1 + О(с-2)] (59 
and 
Ryn" (ie, 0) = [RHS(40)] |1 — (n? — §) 2-0% [1 + OD] (57) 


where RHS(39, 40) is obtained by substituting a from (51) into the right-hand 
side of (39, 40). 

Numerical values of A calculated from (55) in the transition regions are com- 
pared with the previous approximations and with the ref. 10 values in table I. 
The corresponding transition values of c (cf. (43)), 


c= Gust + 
+ {4 (m? — #) [log Can" + 2:043 + oO (x/2)] "ә! 2} Co 4 T О((с„„*)7 3), 
(58) 


are compared with the ref. 10 values іп table У. The differences in the last 
significant figure for n — m > 0 are no greater than the interpolation error 
except for m = 2 and n = 4, where the error is still less than 197. The boundary 
values obtained from (56) and (57) are compared with those obtained from (39) 
and (40), and with the ref. 9 values, in the transition neighbourhoods for 
c — 10 in table IV. The corresponding comparison for c — 5 reveals that (56) 
and (57) are not significantly more accurate than (39) and (40) for such small c. 


4. Angular functions: 4 >с 


The differential equation (1) has a regular singularity at the end point 2] = 1. 
but it has no turning points, and may be modelled by Bessel’s equation, in 
0 x n « Lif A c (02 < 0, B > 1). Introducing the Liouville transformation 


y; 2 «2\1? 
via) = (5) a dress m (59 
and 
u(y) == [a = n?) y (1^ S, Cic, n), (60) 


where ¢ is defined by (9b), and letting с] о with y = O(1) and f >1, we 
obtain 
d2 i—m 
Е ++ + оа) |20) — 0. (6) 
ау? у? 


The solution of (61) is a linear combination of у!/? J,(c y) and у!/? Ү,„(с y). 
Excluding the latter solution by invoking (2), we obtain 
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Sanic, 1) = A (1 — 12) * (02 — a2)7 "4 Ev yy ? Jue y) (622) 
~ A (1 — n?) (2 — 02) 1 cos [c y — (m + 4) 2/2] 

(cy ©), (62) 
where А = A,,(c) is a constant. Invoking (3) yields the eigenvalue equation 
(42b) and 

А = |a|? (—)!'? Р,"(0) (1 even) (63a) 
= |«|- 2/2 с (—)9- 10/2 p,"(0) (1 odd). (63b) 


5. Angular functions: c & —4 « с? 


The differential equation (1) has turning points at у= « if 1<0 
O<a<1, 0<f< 1) It may be modelled by Bessel's equation in the 
outer domain с (у — о) > 1 and by Airy's equation in the inner domain 
c(1—15) 1 if c« —A « с2, such that the turning point at n = о is not 
contiguous to either 7 = 0 or7 = 1. 

The outer approximation has the same form as (62), but (59) must be 
replaced by 

уб) = Albo, B), Oo = arcsin [(1 —57)7/8] — (x <1), (64а) 


where ф is given by (9a). Replacing A by A% in (62) then yields 
S, C ic, n) = AX) (1 — 22) 1^ (1% — 02) 04 G n c y)! ? Jale y) 
[c (7 — а) > 1] (65а) 
e A (1—22) U^ (n? — a?) 1/4 cos [с (o, В)— (т + 3) 212] 
(cyto). (65) 


The inner approximation тау be constructed through ће Langer trans- 
formation 


$42. :2N 1/2 
&(—z2?- | (5 = E ) dt = ф(о) — $(0,, o), 0, = arcsin (74/0) 


n 


O<n<a<1),  (66a,b) 


n {2 —g2\ 1/2 | 
sen - | (=) dt = 4(6)— 40,6) (<<), (67) 


апа 
w(z) = [(1 — 2?) z'(T ^ S, Cie, 7), (68) 
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where ф(—) and $(—, —) are given by (10) and (9a). Transforming (1) and 
letting c 1 œ with z = O(1) and 0 < f < 1, we obtain 


Е +e? z+ oc | w(z) = 0. (69) 


The solution of (69) is a linear combination of the Airy functions Ai (—c?/? 2) 
and Ві (—c?? 2). Excluding the latter solution in anticipation of the outer 
matching requirement, we obtain (cf. (46)) 


Sinn (—ie, п) = s! c8 4 [(1 — 2) (02 — ауур] 44 Ai 02/3 2) 
с) 21) (70) 


~ AO (1—12) 1/4 (y? — 02) 0/4 sin {с [ф(В)— (80, 8)] + 47} 
(c? zf оо), (705) 


~ ZA (1 — 22) 10 (à? — 12) exp {—с [9(0) — 9(8,, 0)]) 
(c?/3 2 | —oo), (70c) 
where A“) is a constant. Matching both the amplitude and phase of (65b) and 


(70b) and then requiring (70c) to satisfy (3), we obtain the eigenvalue equation 
(42а) and 


AO? = (67-102 40 = 24 exp [е d(2)] (1) 


where А is given by (65) and т by (17). 


6. Angular functions: 4 = O(c) 


The form of the outer approximation (65) remains valid for о? = О(1/0), 
but the inner approximation then must be based on Weber's, rather than Airy’s, 
equation in consequence of the interaction between the two turning points at 
7] = + a (which are real/imaginary for 4 s 0). 

Letting «? | 0 in (64) and (65) and neglecting terms that are O(1/c) relative 
to unity, we obtain 


y = (1 — n?) —4 о? log {[1 + (1 — 77)! ]/n} + O(a*) (72) 
and 
Sí, (ic, n) = AO (л c|) In(cy) (0? > 02), (73) 


where А is to be determined. 
The inner approximation may be constructed by analogy with Meixner's 
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asymptotic approximation for the prolate angular functions. Introducing (cf. 
(49) and (52)) 
x = (2c)? y, (74a) 


а=—}со? (74b) 


and letting c f oo with x, a = O(1), we obtain 


d? 
E + i x + а + O(c J Smn(—ic, 1) m 0. (75) 
x 
The required solutions of (75) subject to (3), are 
Sma (ic, 1) = FAM [W(—a, —x) + CO W(—a, x), (76) 
where 
AG = 21 p,n(0) |g|^ H-1 (1 even) (77a) 
= P,"(0) c72 |д|7'* Н (I odd), (TTb) 


W (a, x) is the Weber function of ref. 8, 19.16 ff, and H(a) is given by (41). 
Matching (73) and (76) in the overlapping domain defined by 7 <1, 
x |а, су» 1, in which 


S, ~ AQO 5717? cos {с [1 — 2 n? —4 a? log (2/n)] — (т + 4) 2/2} (78а) 
апа 


Sin? ~ AO [1 + ехр(—2х а)]!'/® x7 !? cos (3x? -+ alog x + 6 — іл), 
(78b) 
where 
ô = — arg I'(3 + ia) + (C) Gr — x) (79) 


and x is given by (33), we obtain the eigenvalue equation (cf. (55)) 


+ kalogc —$ arg ГО + ia) TF sa) = (n+ 3) 2/2 Uu) (80) 
о 


апа 


^ NU2 
A = (20-44 [1 + exp C2. d)]!/* AO PERS ( a) S 


7. Angular functions: 4 + c? = O(c) 


Flammer (ref. 3, $ 8.2), following Meixner, gives a systematic asymptotic 
expansion of S,,(—ic, n) in Laguerre polynomials. The leading terms in the 
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expansion and the corresponding eigenvalue equation are given by (ref. 3, 
eqs (8.2.9) and (8.2.11); cf. (25)) 


Ac c = 2тс— 5 (12 + 1—m?) + O(c^?) (82) 
and | 


S, C (—ic, n) = Aon ü 2 aye L,™ [2c (1 иш 7)] exp [—с (1 "ж 1)], 
(83) 


1/2 even 
v= ( l ) (84) 
(1— 1)/2 ода 


where 


L,“™ is a Laguerre polynomial, and 49"" (which Flammer does not determine 
explicitly) is implicitly determined by (3). In fact, although Flammer does not 
so state, (83) and the higher approximations given by ref. 3, eq. (8.2.9) are 
outer approximations and are valid only for 1 — n = O(1/c) and 4 + с? = O(c) 
or, equivalently, 8? = O(1/c). 

The counterpart of ref. 3, eq. (8.2.9) for c (1 — 5) > 1, in which domain 
(1) has no turning points if 5? — O(1/c), may be obtained by a straight- 
forward expansion of the form (cf. (26)) 


Sian (ie, N) = Cus (SQ) + (CF 5—2), . (85 
where 
Cmn = $ P,"(0)/S(0) (1 еуеп) (86a) 
= }Р,""(0)/5'(0) (оаа) (86b) 
and 


500) = @ — 97)? exp ( 7 — т arctanh у + x одго), (87) 


5=1 


in which f,(7) may be obtained by substituting (87) into (1) and equating powers 
of c. Retaining only fo, yields 


S) = (1 — 08-00 (1 +) #292 exp (c y). (88) 


Matching (88) to (83) in the overlapping domain defined by 1/c &'1— у « 1 
and 7 > 1/с, we obtain 


Ao" = 27° v! C (c) " exp c. (89) 


TABLE I 


Amn(—ic), as determined from (16) and (23), (25), (37), (42), (55), and the eleven-place tables of Stuckey and Layton 10). The 
eigenvalues calculated from (16) and (23) for c = 20 agree with the ref. 10 values to at least three (typically four) significant 
figures for m = 0(1)2 and n — m = 0(1)8 


x9ST 


00 —1608 —1561  —1553 —3610 — —3605 —3610 
1 —1605  —1559 —1553 — 3610 — —3605 —361:0 
2 —245' '—192  —080 — 241 2852  —2847 —2852 
3 006 . 0°52 1:52 0-10 —2852 2847  —2852 
4 8-63 9-06 8-75 8-60 —2140  —2134 — 213-8 
3 18-09 18-47 18-40 ~ —1533  —1568  —1483  —1497  —1529  —1462 —21140  —2134  —2138 
6 29:92 3028 3022 — 257 — 3:10 101 2:59 10-63 —1479 —1413 —1472 
7 43-81 4415 4410 — 11-46 — 1191 12-23 11-46 10-63 —1479 —413 —147-2 
8 59:74 60:07 60-02 — 29-78 — 879 — 872 — 856 


WON с чл шоо е 


2 —2847 —283-1 
3 —283-1 —2847 —2831 
4 —2118 | —134 —2117 
5 —2118  —2134  —2117 , 
6 —1457 —1473 —1450 
7 —1457  —1473  —1450 
8 —855  —874  —834 
9 —855  —872  — 834 


—348*) — 271 


*) (55) gives —33-6. 
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TABLE П 


Кы (—ic, 0) and R,,,G?'(—ic, 0), as determined from (18) and (19) using (24), from (39) and (40), and from the eighteen-place 
ref. 9 tables 


Вы%(—120, 0) 


RU 5, 0) Ren (-110,0) . Rati 10, 0) 


ref. 9 (39) ref. 9 (18a) (39) ref.9 — ^ (192) (40) (182) (39) 
0 2287—01 2280—01 | 1058—01 1058—01 1057—01 0 0 0| 5134—02 5134—02 5133—02 
0 0 0 0 0| 9454—01 9445—01 9461—01 0 0 0 
3303—01 3310—01 | 1230—01 1228—01 1227—01 0 0 O | 5447—02 5447—02 5446—02 
0 0 0 0 0| 8132—01 8140—01 8147—01 0 0 0 


6653—02 6835—02 | 1:662--01  1:683—01 1659—01 0 0 0| 5856—02 5856—02 5855—02 
о 0 0 0 0| 5913—01 5923—01 5928—01 0 0 0 


1011—06 | 1-070—03 — 1:082—03 0 — 0| 9-508—02 — 9-502—02 


о ол шын о 


- 


9:952—07 


1 2780—01 2804—01 | 1128—01 1128—01 1129—01 0 0 0| 5279—02 5279—02 5280—02 
2 0 0 0 0 0 | 8866—01 8:867—01 8857—01 0 0 0 
3 2271—01 2195—01 | 1379—01 1375—01 1383—01 0 0 0| 5634—02 5634—02 5635—02 
4 0 0 0 0 0| 7247—01 7210—01 7226—01 0 0 0 
5 1:550—02 1-480—02 | 1963—01 — 1964—01 0. — 0| 6113—02 6113—02 6115—02 
6 0 0 0 — 0] 4157—01 — 4119—01 0 0 0 
1 1124—07 1094—07 | 2:377—04 — 2:326—04 0 ^ 0| 1134—01 — 1135—01 


2 3346—01 3310—01 | 1217—01 1227—01 0 0| 5438—02 5438—02 5446—02 
3 0 0 0 0 0} 8220—01 8:147—01 0 0 0 
4 8983—02 6835—02 | 1621—01 1668—01 1:659 —01 0 0| 5844—02 5844—02 5855-02 
5 0 0 0 0 0 | 6099—01 5:928 — 01 0 0 0 
6 3-493—03 2752—03 | 1-637—01 — 1:532—01 0 O | 6413—02 6413—02 6430—02 
7 0 0 0 — 0| 2391—01 2:218 —01 0 0 0 
12 1-256—08 1090—08 | 5174—05 4612—05 0 0| 1095—01 — —*) 
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*) (39) is invalid for c = 20, т = 2, п = 12, for which « = 0-083. 


xLST 


TABLE Ш 
Rn’? (—ic, 0) and Rin’? (—ic, 0), as determined from (18) and (19) using (24), from (39) and (40), and from the eighteen-place 


ref. 9 tables 
ref. 9 (18b) (39) ref. 9 (19b) (40) 


К, 20, 0) 


ref. 9 (18b) (39) 


о 0 1-003—03 | 5:194— 08 О 4500—08 | 9454—01 9455—01  9-641—01 | 1-068— 16 0 9244—17 
1 2281—01 | 1058—01 1058—01 1-057—01 | 4642—07 0 4027—07 | 5134—02 5134—02 5133—02 
2 1802—01 | 6743—05 0 6403—05 | 8132—01 8140—01 8147—01 | 6167—13 0 5848—13 
3 4505—01 | 1230—01 1228—01 1227—01 | 4463—04 0 4253—04 | 5447—02 5447—02 5446—02 
4 1:050 00 | 1:405 —02 0 1371—02 | 6018—01 5923—01 6029—01 | 7:480-—10 0 7250—10 
5 3:216 00 | 1691—01 1683—01 1687—01 | 5638—02 0 5511—02 | 5856—02 5856—02  5855—02 
0 2001 04 | 1:182 Ol — 1-170 01 | 9:342 01 9-238 01 | 5:668 —03 0 5614—03 


1 1 2:316— 02 | 1978—06 0 8-866—01 8867—01 8857—01 | 8335—15 0 9747—15 
2 2:826—01 2860—01 | 1128—01 1128—01 1129—01 | 1:555 —05 0 1806—05 | 5279—02. 5279—02 5-280—02 
3 4802—01 4919—01 | 1:070—03 0 1149—03 | 7252—01 7270—01 7232—01] 2214—11 0 2392—11 
4 9805—01 1:010 00 | 1-380—01  1375—01 1-384—01 | 5:695— 03 0 6037—03 | 5634—02 5634—02 5635—02 
5 3122 00 3-228 00 | 8-560—02 0 8826—02 | 5093—01 — 5.093—01 | 1:625 —08 0 1707—08 
6 1289 01 1:332 01 | 2406—01 4245—01  2428—01 | 3.130—01 — 3:233—01 | 6113—02 6113—02 6115—02 
1 1-671 05 | 4604 01 — 4690 01 | 4208 02 — 4299 02 | 3-325—02 — 3-381 —02 


2.2 1232—01! 1802—01 | 3565—05 1216—01 6403—05 | 8220—01 8220—01 8147—01 | 3-166—13 0 5848—13 
3 4007—01 4505—01 | 1217—01 0 1227—01 | 2410—04 0 4253—04 | 5438—02 5438—02 5446—02 
4 9036—01 1:050 00 | 1016—02 1668—0{ 1371—02 | 6170—01 5997—01 6029—01 | 5:134—10 0 7250—10 
5 2659 00 3216 00 | 1:640—01 0 1687—01 | 4194—02 0 5511—02 | 5844—02 5844—02 5-855—02 
6 1103 01 1332 01 | 2297—01 8-222 01 2489—01 | 6109—01 — 6527—01 | 2:548 —07 0 3219—07 
7 5:604 01 6713 01 | 4182—01 0 4-509—01 | 1204 00 0 1:042 00} 6413—02 6413—02 6430—02 
12 1:345 06 1:530 06 | 1-886 02 2:088 02 | 1:933 03 2:168 03 4 1-045—01 — —*) 


*) (39) is invalid for c = 20, m = 2, п = 12, for which а = 0:083. 
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TABLE IV 


The second approximations provided by (56) and (57) in the transition neighbourhood for с = 10 


А2710, 0) 


„090—2 10, 0) Р, (—i 10, 0 К, 10, 0) 


ref, 9 (39) (56) ref. 9 (40) (57) ref. 9 (39) (56) 


1:662—01  1-659—01 1663—01 
0 0 0 
1522—01 1:532—01 1520—01 


ref. 9 (40) (57) 


0 0 0 
5$9013—01 5928—01  5-909—01 
0 0 0 


1:4405—02 1371—02  1406—02 
1691—01  1687—01  1:692—01 
2:508 —01 2489—01  2-516—01 


6018—01 6029—01 6014—01 
5:638—02 5511—02 5637—02 
6570—01 6527—01 6578—01 


5 | 1963—01 1964—01! 1962—01 0 0 8-560—02 8826—02 8-597—02 5093—01 5:098 —01 
6 0 0 0 | 4157—01 4119—01 4155—01 2406—01 2428—01 2407—01 3130—01  3-233—01 3:149 —01 
7 | 6438—02 6-303—02 6446—02 0 0 0 4712—01 4768—01 4700—01 


1:553 00 1:587 00 1:551 00 


5 0 0 | 6099—01 5928—01! 6112—01 1-640 — 01 1:636 —01 5511—02  4418—02 
6 | 1637—01 1:532—01 1624—01 0 0 0| 2297—01 2489—01 2277—01 6109—01  6527—01 6158—01 
7 0 0 | 2391—01 2218—01 2427—01 4182—01 4509—01 4121—01 


1204 00 1042 00 1195 00 
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TABLE V 


The transition value of c, at which 2,,,(—ic, 0) = 0, as determined from (43) 
and (58) and from linear interpolation of the ref. 10 values (the interpolation 
is over a с interval of 4/1 for с < 10). The values given by (43) and (58) for 


т == п = 0 are spurious 


ref. 10 (58) (43) | ref. 10 (58) (43) 


2:85 287 2-75 
3:54 3:55 3-53 
595 5:96 589 
6.69 669 668 
9-08 908 9-03 
9-83 9:83 9-82 
1219 1221 1217 
12-97 1297 12:96 


со чо t1 & шо t9 н с 
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Abstract 


The Wiener-Hopf solution of the reflection and radiation problems for 
an open-ended parallel-plane waveguide is conveniently described in 
terms of Weinstein's diffraction function. In this paper, the accuracy 
of a commonly used approximation to Weinstein's function is examined. 
Jn addition, some new series expansions for Weinstein's function are 
presented. 


1. Introduction 


During the last decades the so-called Wiener-Hopf technique has proved its 
power in the solution of a broad class of diffraction problems. A survey of the 
early literature is presented in Bouwkamp's well-known review paper (ref. 1, 
$ 7). Later contributions are reported in the books by Noble ?), and Mittra 
and Lee 3). The Wiener-Hopf technique was originally developed by Schwinger, 
Levine, Copson, Heins and others, and independently in Russia by Fock and 
Weinstein (or Vainstein). Weinstein's work of the years 1948-1950 deals with 
the reflection and radiation problems for an incident mode travelling toward 
the open end of a semi-infinite parallel-plane waveguide or circular pipe. In a 
thorough treatment, detailed analytical and numerical results are presented for 
the modal reflection coefficients of the reflected wave in the guide, and for the 
radiation pattern of the field radiated into the exterior free space. Weinstein's 
papers first became more readily available through a translation by Shmoys ^), 
a translation which was suggested by Bouwkamp. А full account of 
Weinstein’s early work is to be found in part 1 of his recent book 5). 

Consider the reflection and radiation of an incident TM or TE mode from 
the open end of a semi-infinite parallel-plane waveguide with perfectly con- 
ducting walls. Referring to Weinstein *), chapter 1, the solution to this problem 
can be expressed in terms of the auxiliary functions 


OS T 


1 c 
U*(s,p) = — flog [1 &ехр({рсвт]————4 — @) 
2л FA sin т — 5 


where —1 < s <1, p > 0, and the principal value of the logarithm is to be 
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taken. The path Г, is the steepest-descent contour Re (cos т) = 1 that passes 
through the origin; Го intersects the real axis at an angle —л/4 and has asymp- 
totes Кет = + 4x. The integrals U+(s, p) arise in the Wiener-Hopf proce- 
dure of factorization of certain analytic functions. Except for a slight change 
of notation, U*, U- are identical with the functions V, U, respectively, as 
defined in Weinstein 5), form. (10.07), (10.18). We shall call the functions 
U+(s, p) exact Weinstein functions. 
Under the transformation cos t = 1 + it?, the integral (1) reduces to 


bor: 1T iP) +410)? 


in which the logarithm and. square roots stand for principal values. The latter 
representation is not cited in Weinstein ?). Instead, Weinstein introduces what 
we shall call approximate Weinstein functions, defined by 


со 


- 1 
Ü*(s p) = —— | log I + exp Gp —p P) (3) 


di 
(Eds 


The underlying idea in this approximation is that for large p the main con- 
tribution to the integral (2) arises from the vicinity of the saddle point / = 0. 
In this vicinity all terms /? in the integrand are neglected, thus leading to (3). 
The present functions Ü *(s, p), Ü-(s, p) are identical with Weinstein's func- 
tions Vio, q), U(c, q), respectively, with с = s p!/?, q = p/2x; see Weinstein 5), 
form. (10.08), (10.19). 

Weinstein did not examine the accuracy of the approximation Ü*(s, p), nor 
did he investigate whether the approximation is uniform in s. It is the aim of 
the present paper to fill this gap. Thus it is shown in sec. 2 that 


U*(s, p) = Ü*(s, p) + O(p^?) (4) 


uniformly in s over the range —1 < s < 1. In addition we derive some further 
series expansions and approximations for the exact Weinstein functions U*(s, p) 
(sec. 3). Counterparts of these results, pertaining to the approximate functions 
Ü*(s, р), were provided in Weinstein 5), Appendix B. As an application, we 
present an improved approximation for the self-reflection coefficient of an 
incident mode near cut-off in an open-ended parallel-plane waveguide. 

Due to the simultaneous analysis of the functions U* and U-, Ü* and U-, 
double signs + or F appear at various places. Throughout this paper it is 
understood that the upper sign corresponds to U* or U+ and the lower to 
U- or U-. | 
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2. Accuracy of the approximation [7*(s, р) 


We first establish some simple properties of the Weinstein functions. It is 
obvious from (2) and (3) that 


U*(—5, p) = —U*(s, p) Ü*(—s, p) — —Ü*(s, р). (5) 


Both the exact and approximate functions are discontinuous at s = 0. Indeed, 
when applying Plemelj’s formulas to the Cauchy-type integrals (2) and (3), it is 
found that 


U*(+0, р) = U*(+0, р) = 3108 [1 + exp (ip)], 
U*(—0, p) = U*(—0, p) = — log [1 + exp (i p)]. 


Hence, the approximation Ü*(s, p) becomes exact at s — 0. 

By means of Laplace's method in a properly modified form we derive asymp- 
totic expansions for U*(s, p), Ü*(s, p), valid for large p provided that s is not 
close to zero. On expanding the integrands of (2) and (3) in Taylor series around 
t = 0, a term-by-term integration yields ` 


(6) 


14i; 
U*(s, p) — A | log [(1 + exp (ip — p t?)] dt + O(s7? р- 3/2) 


14-i S*(p) T 
= рк yp b OCP), 0 
= 1+i S*(p) 
О+(5, p) = PET тр + O(s7? р-3/2) (8) 
where 
1)" j 
S*(p) =— CF 1) oxp Um p) ; (9) 
mi? 
т= 1 


Consequently, when s is away from zero the approximation Ü*(s, p) is correct 
up to order p^?^2, 

Next, we examine the overall accuracy of Ü*(s, p) over the range —1 < s « 1. 
In virtue of the symmetry of the integrands (2) and (3), we may set 


ut ES ей pe = NETT 
epe og[ бой дБ жсти PS ) 


js? 


m Ir . s dt 
U*(s,p) = — | log [1 + exp pp: ———. an) 
л, 212—1 
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Consider now the difference 


U*(s, p) — Ü*(s, р) = Е, + E; (12) 
where 
je | s(t? — i) ; 
Е, = — [ов [1 + exp(ip—pt?)] | [0 +3212) 2/2 — 1] de, 
л 14 — 2112 — 52 
o (13) 
Е = meent 2 st? (t? —is?) gi (14) 
=— | lo ex — p t?)) ——————————. dt. 
^^ ж. н PUP PON G4 01252) 012—152) 


We establish a number of auxiliary estimates valid for —1 < s <1, p> 0, 
t>0: 
2 


t 
log [1 + exp Gp —p 22) < —log [1 — exp (—p2)], |1 + 4207)" 1^ — 1| <T 


st (t? — i) t t 


-——————— === — [—Ó——————— 


s—2i?—s| 2|?—i-iQ—s)? t?—i—i(l—s?)"? 


«2-32 |s| |П —(1—s2)2 12 + [1 + (1 — §2)1/2)- 1/2 «2-712, 


st(t? — is?) 


st(t? — i) Lisci 
(r^ —2 it? — 52) Qt? — is?) | 


1 рутата 
14 —2112 — 52 


By means of these estimates it follows that 


|E] < а [1 (=p t?)\td ! у е 
те о — exp (—p t^) tat = —— — = —, 
7 4л „ E Pus 8л p m? 48р 
mci 
л 
ы] <-> 
12р 
This proves that indeed 
U*(s, р) = Ü*(s, р) + O(p^?) (15) 


uniformly in s over the range —1 <s <1. The error involved is less than 
(52/48) p~ +, however, this estimate is not claimed to be sharp. 
As a side result of the preceding analysis we have 


joy | s(t? —i) 
U*(s,p) = — [logii + exp (p—p 7 Gt + 0070), (16) 
л { &— 21? — 52 
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again uniformly in s when —1 < s <1. Setting s = sin p, л < ọ < 47, 
we substitute 


(t? — i) sin 9 sing ` sin Фф 


t*—2it?—sin? рф 2t?—4isin?io 212 — 4ісоѕ2 4p 


then it is easily recognized that 


U*(sin p, p) = Ü*(2sin3g, р) costy + 0+(2соѕ 3, p)sindp + O(p-3), (17) 


uniformly valid for —іл < 9 < 4a. In a way the latter approximation is more 
natural than (15). Since 2 cos 3o > 2, the second function Ü* can be re- 
placed by its asymptotic expansion (8), thus leading to 


1+i 


U+(sin p, р) = 0+(2 5іп 40, p) cosg + 
4лі/2 


S*(p)p~*/? tangy + О(р-!) (18) 


uniformly in o over the range —z < 9 < ўл. 


3. Expansions for the exact Weinstein functions U*(s, p) 


The approximate functions Ü*(s, р) are comprehensively discussed in Wein- 
stein 5), Appendix B. From this reference we quote the expansion 


m spi? spi? 
U*(s, р) = 4 log [1 — exp (i 0)] + io (1 T ams) — Qus + 
1—i exp(imó) i 
+ эп spi? n» T 
m=1 
m T 
1 1 m—n— — Жу ыз: c сша == 
ü b» os( i Eos e) Ота +A 
m=1 
is pi? is pi? 
1 1 ——____—_—_ — |, 19 
ji os ( [2 Sean) T [2 asma (19) 


valid for s 2 0, p >0, p = п л + ôt where 0 < 0* < 2z and nt (n7) is 
an odd (even) integer; for simplicity the superscript + to д has been sup- 
pressed in (19). Starting from this expansion Weinstein deduces the approxi- 
mation 
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i 1-і 
pP ie Н) 


ў i л 
Ü*(s, р) = log (81/2 + 27 1/2 s p2) — ; 


in which В = —Z(4)/x*/? = 0-824 and £ = max (s? p, ô). The approximation 
(20) applies when s p!/? is small and p is close to an odd (even) multiple of л. 

In this section we shall derive counterparts of (19), (20), pertaining to the 
exact Weinstein functions U*(s, p). It is assumed that 0 < s <1, p > 0 and, 
just provisionally, p = n+ x where n* (n7) is an odd (even) integer. Consider 
the derivative dU+(s, p)/ds as obtained by differentiation of (1) with respect 
to s. The resulting integral is transformed into 


007+(5, р) _ сев exp (i p cos т) 
os 1 + exp (ip cos т) 


Sp exp (i p cos т) dr 

а саре с шй 

2л} 1 + exp (ip cos т) sint—s 
0 


through an integration by parts. Now, the first integral in (21) can be reduced to 


I у Gr 0" f exp Gimp cos v) dep) Gr 1)" Нут p) 
Ji 
т=1 Го т=1 


by calling upon a well-known integral representation for the Hankel function 
HQ. The second integral in (21) is evaluated by contour integration and 
residue calculus. To that purpose, consider the integral along the closed con- 
tour Г that consists of Г, the line Re t = —4x and a segment of the line 
Im т = —R where R is chosen such that no poles of the integrand are passed 
through. Then the contribution of the line segment tends to zero when R — со, 
whereas 


—n/2-- ico > i 
exp (i p cos т) dr | 
= F —— (4a — arcsin 5). 
1 + exp (ip cost) sint—s (1 — s?)"? 


—n/[2—ico 


Notice that the latter integral is to be understood as a Cauchy principal value 
when the lower sign applies. The contour Г encloses infinitely many poles given 
by cos т = ma/p where m is an odd (even) integer dependent on the pre- 
vailing upper (lower) sign in the integrand. А finite number of poles lies on 
the real axis between —4z and 0, the remainder is located on the negative 
imaginary axis. The residues at these poles may be evaluated in a standard 
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manner. Notice that the pole т = —4z lies on Г, hence, its residue should 
be counted half. In so doing, the ultimate result is found to be 


: 
oU s OU (Sp) 2) (F 1)" НӘ (т p) КЕС РЁ ы. (4% — arcsin s) + 


2л (1— 52)1/2 
3 Em 5 p? Q2 
B A — m? л?) [(p? — m? л?)!? + sp] ) 
тєМЁ 


in which eg = 1, & = 2 for т 5 0 апа № (N^) denotes the set of odd (even) 
integers 20. The square root (p?— m? z?)'/? is positive imaginary if 
тл > р. 

The derivative (22) is to be integrated with respect to s. Then, in virtue of 
the initial value (6), one gets the expansion 


U*(s, р) = 4 log [1 + exp (ip)] — I Br — s — (1 — s?)!? (л — arcsin )] + 
JU 


+4sp ba CF 1)" Ho? (m р) + 


+ i 2 Em E (1 + — 5 oes) oum] (23) 
(p? n Us (p? — m? m2)!/2 


valid for 0 < s < 1, p > 0, p з n* л. The latter restriction can be dropped, 
since the expansion (23) tends to a finite limit when p— nt л, as will be 
shown below. 

We now set p = п л + ôt where 0 < 6+ < 2л and n* (n7) is odd (even). 
For simplicity the superscripts + to n and ô are suppressed. Then for n > 1 
the expansion (23) can be reduced to 


U*(s, p) = 4 log [1 — exp (i 6)] — = [3x — s — (1 — 52)1? (4a — arcsin s)] + 
л 


С)" Нотр) + log( 1 + ——* 2 
iss) (CD атр) осн) 


(р? 25 п? л р? Му п? л2у\? 
т=1 
5 
niea д] 09 
T — ms (p?—m? 22)! 
meN+ 


min 


168* J. BOERSMA 


Guided by the asymptotic behaviour of the Hankel function, we write 


pi? Y (—1)"" Hom p) = 
m=1 


ў | 1—i exp (imp) 

= __түтп у] —————À— M 

T ( 1) Е Hy" (m p) лі? тї? | + 
т=1 


1—i (—1)™ exp (i m p) 
+ лі? тїї? ‚ 


т=1 


then the first series is absolutely convergent for p > 0, since its general term 
is О(т- 3/2). The second series is reduced to 


oo 


1—i exp(imó) 1—i | . 
лі? a» лї? exp (8) Ф(ехр (4 д), 3,1) 


т=1 


where Ф stands for Lerch’s transcendent, see e.g. Erdélyi et al.5), sec. 1.11. 
From the latter reference we quote the expansion 


© 


t exp (i 8) (exp (14), 3,1) = 24/2 8-12 + = >: (G7 Gy, 


лї? qi r! 


r=0 


valid for 0 < 6 < 2л where © denotes Riemann’s zeta function. 
Combining the preceding results, it follows that 


pi? Y, (—1)"" Ho®(m p) = 21/2 8-12 — (1 — i) В, + 000%?) QS) 
m=1 
where the coefficient f, is given by 


со 


В„=—л^1% Lr ++ У (ex (1 + znm?HQ?(mnza)— ax) Q6) 
m 


m=i 


For numerical purposes the Hankel function may be replaced by the first two 
terms of its asymptotic expansion, thus leading to 
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„©, HOY ыыы (27) 
п 


mi? ' 813/2 п 


ba = 


It has been checked that the relative error involved is equal to 6 . 107? n7?. 
On substitution of (25) into (24), it is easily seen that all singularities cancel 
and the expansion (24) remains finite at 6 = 0, i.e. at p = пл. In addition we 
deduce the approximation 
Uso о) 7^ EO 
ч Qp—ày2] 4 2 


in which € = тах (s? p, ô). The present approximation applies when sp!/? 
is small and p is close to n* л, nt > 1. 

The case n^ = 0 corresponding to 0 < p < 2z, needs a separate investiga- 
tion. Employing Poisson's summation formula, it can be shown that 


Pepa О. A | 1 1 
тр) = - +-log ——~——— i —— — —— 
Ў, i i р x ES 2 mx (4m? x?—p?)/2 2 тя 
т=1 m=1 


(29) 


valid for 0 < p < 2x where у is Euler’s constant. The latter result is inserted 
in the expansion (23) for U~(s, p). Then by re-expansion for small p, we obtain 
the approximation 


лі ip 4л; лі 
U~(s, p) = 3108 [р (1 + 317 2 (Ios 1—»y +5) + 
4 2л р 2 
+ (1 — 52)!? (3л — arcsin »| + О(р?). (30) 


As an application of (28), (30), we consider the reflection of an incident mode 
at the open end of a semi-infinite parallel-plane waveguide of width a. Referring 
to Weinstein 5) and Boersma ?), the self-reflection coefficient of the TM, mode 
(n = 0, 1, 2, ...) is given by 


ig, К+ ж 


4 ax? exp [2 U+(,/k, k a)] G1) 


where x, = (k? — n? л?/а?)!'? and k is the wavenumber. In (31) the upper 
sign applies for п odd and the lower for n even. Let now the incident mode 
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be near cut-off, іе. k a — n z = д is small. Then, applying (28) we deduce the 
approximation 


| 1/2 
Ru = —exp| -0—9 B 09" + (=) +00} е 
пл 


valid for n > 1, with f, given by (26) ог (27). By extending (28) up to order 
23/2, it is found that the error term in (32) is actually O(6*/). 

The case n = 0 needs a separate treatment based on (30). Thus for small 
ka we get 


ika 4л лі | 
Roo = —exp Е (108 = +1—y+ =) Toug a|, (33) 


in agreement with the results of Weinstein 5), secs 6 and 9. 

A similar approximation can be derived for the self-reflection coefficient of 
the TE,, mode (n = 1, 2, 3, . . .) near cut-off. Starting from the exact repre- 
sentation for the reflection coefficient 5:9), it is found that 


20 ү! 
йлн Е ув, Обу — (=) n oo» |, (34) 
пл 


The present approximations for R,, are to be compared with the previous 
approximate result 5:3) 


Кы œ —ехр [— (1—0 Ё Q8) 7], В=—б(@ул!'?® = 0:824, (35) 


which is based on the use of approximate Weinstein functions Ü*(s, p). 

Approximations for the self-reflection coefficient of an incident mode near 
cut-off play an important role in the ray-optical analysis of open resonators 9.10). 
It is suggested that such an analysis can be improved Бу utilizing the new 
approximate formulas (32), (34) instead of (35). 
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Abstract 


The Ericksen-Leslie director theory for liquid crystals of the nematic 
and cholesteric mesophases is extended to the smectic mesophase of 
type А. А complete dynamic theory is developed for a model having a 
layered structure determined by a family of parallel surfaces with a 
perpendicular director. The model can be extended to include the effects 
of compressibility and layer disclocations. The method used allows a 
similar treatment of smectics of type C. 


1. Introduction 


The continuum theory of liquid crystals of the nematic and cholesteric 
mesophases was developed by Ericksen +?) and Leslie +). For other presen- 
tations of the theory, see refs 5-11. The continuum theory of smectic liquid 
crystals has received some attention only recently. In the case of smectics of 
type A +”) De Gennes 1°) for a particular situation obtained results which will 
prove to be related to the general non-linear theory presented here 1^). Other 
continuum theories were presented in refs 11, 15 and 16. Most of these theories, 
however, are restricted to small deviations of a plane-layer structure. 

The smectic mesophase exhibits a layered structure. In an equilibrium con- 
figuration the layers are bounded by a family of parallel surfaces. In the case 
of the type-4 smectic mesophase the anisotropy axis or director coinciding with 
the local mean direction of the long axes of the molecules stands perpendicular 
to the surfaces. These facts are taken into account in the model. Some additional 
assumptions, however, are needed. It is assumed that the surfaces which are 
parallel in an equilibrium configuration remain parallel during motion in such 
a way that their mutual distance is time-independent. This means that Ше” 
smectic liquid crystal is supposed to be linearly incompressible or rigid in a 
direction perpendicular to the layers. If moreover three-dimensional incom- - 
pressibility is assumed for the liquid crystal, the particles situated at any one 
of the parallel surfaces constitute a two-dimensional incompressible manifold. 
As the director remains perpendicular to the surfaces in the case of type-A 
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smectics, this two-dimensional manifold behaves as a two-dimensional isotropic 
fluid. According to these assumptions deformation of the smectic liquid crystal 
can take place by bending of the parallel surfaces, by deformation at constant 
surface density of the two-dimensional fluid formed by each parallel surface, 
and by a shearing motion of the surfaces with respect to each other. 

It is possible to include the effects of compressibility and layer dislocations. 
In that case the model must be extended by allowing the surfaces to become 
non-parallel during motion. The present method employing general material 
coordinates has proved useful for the analysis of the extended model too 17), 

In sec. 2 we develop the geometry and kinematics needed for our continuum 
model of type-A smectics. Section 2.1 contains the kinematics of a directed 
medium expressed in general material coordinates, while the geometry of the 
layered structure is introduced in sec. 2.2. The dynamics is treated in sec. 3. 
In sec. 3.1 the known equations for the conservation of linear and angular 
momenta are derived in the usual way from the energy equation and the entropy 
inequality by means of invariance requirements but now in terms of the general 
material coordinates. In sec. 3.2 these results are applied to the continuum 
model and the constitutive equations are derived. Section 4 finally deals with 
a special case, viz., the statics of equilibrium configurations. 

An analysis similar to the one presented here for type-A smectics can be 
given for smectic liquid crystals of type C. 


2. Geometry and kinematics 


2.1. Kinemutics of a directed medium 


The motion of a directed medium, i.e. of a medium provided with a local 
anisotropy axis or director d of unit magnitude, can be described completely by 
giving the position vector R and the director d of each particle as a function of 
time. The particles are usually identified by their positions in some appropriately 
chosen configuration of the medium. In the case of fluids it is advantageous to 
choose for this so-called reference configuration the current configuration +°), 
ie. the configuration of the medium at the present time /. When a general 
curvilinear coordinate system (9,, #2, 9з) is introduced in the current con- 
figuration, the motion of the directed medium can be described by two func- 
tions R(,, 9,, 93, t, т) and d(9,, 9, 93, t, т). They give as a function of the 
time т the past and future values of the position vector and the director for 
each particle characterised at the present time ¢ by the coordinates (#1, 9,, Da). 
The general coordinates (2,, 9,, 9з) act as material coordinates fixed to the 
particles and thereby introduce as convected coordinates 19) a, generally dif- 
ferent, curvilinear coordinate system in each past and future configuration. 

The following short-hand notation will be used: 
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R(t) = R(9$,, 9,, 04, t, т), 
(т) = 4(д,, д, 93, t, T), 
the director having unit length 
d(z) . (т) = 1. (1) 
The translational velocity v and the director velocity u are given by 


` y(t) = E) щт) = 0 2 
dz 


In addition to the director velocity the rotational velocity of a particle is intro- 
duced. It is denoted by 


(2) 


w(t) = w(i, #„, 93, t, т). 
Here the left-hand side is again a short-hand notation for the right-hand 
member. Obviously 
u(t) = w(t) x d(T). (3) 


In each configuration assumed by the medium the convected coordinate 
system (0,, 9, 05) possesses a system of covariant base vectors defined by 


800) = R,(9, EC 
where К (т) = (0/09!) R(0,, 2, 93, t, т). It follows from (2) and (4) that 

dg, 

E = v. © 


In the sequel we shall use the short-hand notation 


f for (f(z: and 3r or f for (=) : 
А dt dt t=t 


where f stands for any scalar, vector or tensor quantity. Then e.g. from (5) 
gi = Ve (6) 
Introducing contravariant components v" by means of 
V= v g, (7) 
one has 
g = YES vl, Н (8) 
where the covariant derivative vl, is given by 


а, = v"; + Ts v. (9) 
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The Christoffel symbol Гү; is defined by 


215 = lt g- (10) 
Introducing contravariant base vectors р! by means of 
g'.g, = б) (11) 
one has 
gi = 8 g’, g =g" 8), (12) 
where 
8) = Bi + £» g” = gl . gl. (13) 


Covariant components v, are now defined by 


Y = 0,2". (14) 
Clearly Р 
Vi = Zij v, vi = g" vj. (15) 
Furthermore 
giX8; = Ek £", gixg = e"* р. (16) 


Here єк = g'/? еу and e/* = 61/2 e"* with eix and e"* representing the 
alternation symbol and g denoting det (g;)). For more details see e.g. ref. 19. 
From (8) and (11) it follows that 


gf = —v'|; g. a7) 
The element of volume dV is given by 
dV = [g; g2 ёз] dð 402 19°, 


where [g, g2 83] = £1 - (8, Хз) = 21/2 is the scalar triple product. Using 
(8) one derives that 


d 
— dV = vil, dV (18) 
dt 
and 
dg 
— = 2004,6. 19 
s eke a9 
The rate of deformation tensor 4,; and the vorticity tensor и; are defined by 
A, = + (бу + Vj) Wi = 5 Wy — 2). (20) 
If one puts 


wt = 4 gk Wkjs Q1) 
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then w = w' р; equals half the vorticity, i.e. 
w = $ curl y. 


Using (8) one can write 


dgi; 
8677 QE) = Ул-бу BV = 


= vy + Vu; == 2Aj;. 


Using (17), one obtains also 


i 
РУТ 
dt 
According to (19) 
dg 
—=24} 
a 18. 


Finally by using (3), (8) and (17) опе derives that 


dd, 
d Eigr (@! — ж!) d* + Ay d, 


аа! 
— = e (eo, — wy) d, — AY 4, 


dt 
ddi; k күл! к k| gt 
ec anus — w^) d'|, + 44, d*|; + вы |, d', 
dd'|, ikl ik ikl 
dt (0. — We) du, — AM dy, + ё coy di. 


2.2. Geometry and kinematics of the model 


175* 


Q2) 


Q3) 


(24) 


(25) 


The general coordinates of the preceding subsection are now chosen in such 
a way that they conform to the layered structure of the model. For that purpose 
the so-called normal coordinates known from the theory of shells 19,29) are 


used. They are defined by 
R = 1(9,, 05) + 9. n(0,, 9,). 


Q6) 


Here r(2,, #„) represents at the present time / one of the parallel surfaces com- 
posing the layered structure and n(#,, #„) denotes the unit vector perpendicular 
to that surface. The orientation of the normal n(@,, #„) is chosen so that 
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(91, 82, 9з) forms a right-handed coordinate system. According to (26) the 
surfaces $4 = constant constitute a family of parallel surfaces representing at 
the present time ¢ the parallel surfaces of the layered structure. For a normal 
coordinate system satisfying (26) one has 


8з = n(9,, 82), Ваз = 8з« = 0, 833 = 1. (27) 
Here and in the sequel Greek indices range over the values 1 and 2. Since the | 


director is assumed to be perpendicular to the parallel surfaces 


d= + n(0,, 92) 
or 
dos d=+41. (28) 


Without affecting the generality the plus sign in (28) is chosen. 
The first and second fundamental tensors of the parallel surfaces 94 = con- 
stant are defined respectively by 


ав = Roa Кр = Ba + Bp = ар (29) 
Bag = —R a + 0р = — &.П = —Пар. (30) 
It is easily derived that 
Газ —b Гар = bap- (31) 
The mean curvature H and the Gaussian curvature K are given by 
2H =b, 2K = Б b$ — bý bf. (32) 
Using 
bg, = By Bp, (33) 


one finds that 

Нз = 2H? — К, K3 =2 HK. 
It is possible to introduce a covariant derivative on any of the parallel surfaces 
$4 — constant. Denoting this derivative by a double vertical bar one has e.g. 

А ; 
Vaig = Va,B — Vay 
118 а,В ap "A А (3 4) 

get]. =T®,+ $i Т^ + T5, T^. 


These covariant derivatives on a surface 4 = constant are related to the 
covariant derivatives in space which were considered in sec. 2.1. For future 
purposes the following formulae are needed: 


о. = |, — bf 23, 
T*|, =T* ||, — 5; (73° + Т8) — 55 T^ + T? s, (35) 
T*!|, 85 qeu b Те Ь* T?3 + T? ,. 
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They are easily derived with the aid of (31). 

For consistency of the model two kinematical constraints were imposed in 
the introductory section. The first one was the requirement that the layered 
structure be rigid in a direction perpendicular to the layers. It can now be 
expressed mathematically by 


v? = 0. (36) 


The condition (36) guarantees that at any time v the material surfaces 9, = 
constant constitute a family of parallel surfaces. The second constraint required 
the director to remain perpendicular to the surfaces 94 = constant during 
motion. With a view to (28), its necessary and sufficient condition is given by 


dd, 
T = 0, (37) 

or equivalently, according to (25), by 
вл (0° — w^) + Aaa = 0. (38) 


Неге £j = £555. 

Since the director remains perpendicular to the surfaces 94 = constant, it 
may serve at any time т as the unit normal in the defining formula of the second 
fundamental tensor for these surfaces. Therefore 


Dap(t) = —das(1). (39) 
Using (25) one obtains 
аһ; ада], 
dí ——d 
= —в® byg (оз — w3) — Ау b — E” yis. . (40) 


Note finally that on account of (22) and (23) 


dasg 24 
dt uc afi» 
iat ae (41) 
dt /— , | 
da 
— =2 Аа 
dt 


3. Dynamics and constitutive equations 
3.1. Dynamics of a directed medium 


The equations for the conservation of mass, linear momentum and angular 
momentum may be derived from the energy equation and the entropy inequality 
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by means of invariance requirements 3:21). In the case of a directed medium 
the energy equation and the entropy inequality read respectively 


d 
xz Јену +. pmi dd) Addo + dV = 


= [or 9v L.o)dv4 [ {Т.У - M. —4)d4 (42) 
y oV 


and 


d r q 
— sdV — —dV+ | —dA z 0. 43 
с Jez т > (43) 


Here о is the mass density, и and å are the local moments of inertia per unit 
mass for rotations about axes respectively perpendicular to and along the 
director, e is the specific internal energy, F the specific body force, L the specific 
body couple, T the surface-force density, M the surface-couple density, while 
r represents the body heating per unit mass and q is the heat flux at the boundary. 
Finally s and T denote respectively the specific entropy and the absolute tem- 
perature. In the spirit of continuum mechanics both equations (42) and (43) 
are taken to be valid for any volume V occupied by the medium and bounded 
by a sufficiently smooth surface dV. In addition to the body and surface couples 
one can introduce the director body force G and the director surface force S 
given respectively by 

G-—Lxd, S=Mxd. (44) 


Applying (43) to a vanishingly small tetrahedron bounded by the three coor- 
dinate surfaces and by an arbitrarily directed surface element with normal v 


one obtains 
q = q'». (45) 


Here q denotes the heat flux for the surface element with normal v, while the 
heat flux associated with the coordinate surface 2, = constant is represented 
` by q! (gif)* 12, For details, see ref. 19. 

The energy equation should be invariant with respect to superposed rigid- 
body motions. The case of a rigid-body translation will be considered first. For 
that purpose in (42) the velocity v is replaced by v* == v + vo, where Vo is 
independent of R. It is then easily seen that the resulting equation is independ- 
ent of v, if and only if Е 


[@ + е) аи = о (46) 
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and 
peg nar [ 724. (47) 


ey 


In the derivation use is made of (3), (8), (17) and (18). Equations (46) and (47) 
expressing respectively the conservation of mass and linear momentum are 
valid for any volume V. One therefore immediately infers from (46) that 


à evil — 0. (48) 
By the tetrahedron argument one concludes from (47) that 
T=T »,. (49) 


Here T is the stress vector for a surface element with normal v, while the stress 
vector associated with a coordinate surface 9, = constant is given by 
Т! (g/^- 1/2, By virtue of (49) it follows from (47) that 


ov = QF +T',. (50) 
Written in component form, eq. (50) reads 
Qc — oF TU, (51) 


with v = c! g, and T/ = T" g, 
With the aid of (45), (48), (49) and (50) the energy equation can be reduced to 


feto. Gg ad + Add]. +} dv = 
y 


=| Esi: о). „}4/ + [M.wdd. (5% 


av 


Equation (52) being already invariant with respect to rigid-body translations 
should also be invariant under superposed rigid-body rotations. In this case 
the rotational velocity w is replaced by w* = w + wo, where wọ is independ- 
ent of R. At the same time v is replaced by v* = v + w.)x(R—R,) and 
accordingly v,; by v,;* = v, + oo Xg. Performing these substitutions in (52) 
and requiring the resulting equation to be independent of « one obtains 


Јева cian. o o —oL-- Тхе} dv = [ мал. (53) 


oV 


From this equation expressing the conservation of angular momentum it again 
follows that 
M = м” vi (54) 
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and 2 
o [u(g'gi—dd) +144]. о =gL—T/xg,+ М. (59) 
Note that М (227) 1/2 represents the couple-stress vector associated with the 
coordinate surface 9, = constant. Written in component form, eq. (55) reads 
o [u (0j — d! d) + Ad! dj] W = o L! — &"* Tj, + M", (56) 


with © = h g; and М) = МУ g, 
By using (54) and (55) the energy equation may be reduced further. Its dif- 
ferential form can be written as 


ов= er—q' + Т.у; + (Тхе). о + M/.o,, 
or equivalently, by using (20) and (21), as 
оё= or—4'|, + TV Au + в TY (0 — wh) + MY оц). (57) 
Combining (57) with the differential form of the entropy inequality, viz., 
es>e(/T) + Q/T) 4, + 4' (Т), (58) 


one obtains | 
TY Ay + вик TY (08 — w") + MY ou, (Т) Ta—0($ - 5T) 20. (59) 


Here 9 = €— Т з represents the specific free energy. The reduced entropy 
inequality (59) is the starting point for the derivation of constitutive equa- 
tions ??). 

For a homogeneous liquid crystal the specific free energy y is assumed to 
depend on the density o and the absolute temperature T, and on the director d 
and its gradient d ү. When general curvilinear coordinates are used, also the 
covariant base vector g, enters the description. Therefore 


9 = glo, 5, d, dis Т). 


This functional dependence is subject to the so-called principle of material 
objectivity 23), which requires invariance with respect to rigid-body motions. 
It follows that 


9 = Ф(0, Zij 4, du, T). (60) 


The representation is not valid for all smectic mesophases. The smectic meso- 
phase of type C requires the introduction of another director describing the 
anisotropy within the layer, while the simple dependence on о is no longer 
justified in the case of the solid-like type-B smectics. 


3.2. Dynamics and constitutive equations of the model 


The conservation equations discussed in sec. 3.1 are now applied to the 
layered structure of the model. Since the liquid crystal is assumed to be incom- 


P E 
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pressible and the motion of the layered structure is restricted by the constraint 
(36), the equation (48) for the conservation of mass assumes the form 


Ав = vell, — Б v? = 0, | (61) 
or equivalently according to (41) 
= 0 (62) 
dt 


In the derivation of (61) use has been made of (35). Equations (61) and (62) 
both express the incompressibility of the two-dimensional fluid contained within 
a surface #, = constant. With the aid of (35) the equations for the conser- 
vation of linear and angular momenta (51) and (56) can be written respectively 
as 


ос®=о Е 4. т, A b; (T3! + Т?З) E T 4- T z, 


(63) 
o сЗ — 0 ЕЗ + Ts] + bag T« — bt T3? + Тэз; 
апа 
e u h* = o L* 4- М“, — bg (M! + МӘЗ) + 
№ M? + M”? n TT. А 
B з + 87 (Tag — Тз) (64) 


о Ah = о D? + М, + bag MP + 
—b M** + M33 , — є, T”. 
Because of the constraints (36) and (38) the reduced entropy inequality (59) 
simplifies into 
T" А. + e T (o? — W°) + 2 T? Aes + 
+ М“ wag HM? was + M?* Osa + M9? 004,4 + 
Т) T, — (Т) T.s — о (ф + 51) > 0. (65) 


In the case of the model the functional dependence (60) for the specific free 
energy assumes the form i 


9 = p(o, а,в bags T). (66) 


On account of the assumption of incompressibility the functional form (66) 
may simply be written as 


P = Plaag, bag, T). (67) 


The expression (67) for the specific free energy should be invariant under 
arbitrary transformations of coordinates on a surface #3 = constant. Accord- 
ing to the theory of invariants the function (67) can be reduced to a depend- 
ence on T and on the absolute scalar invariants of a,, and 5,5, viz., the mean 
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curvature H and the Gaussian curvature К. Assuming the director to be non- 
polar one therefore obtains the following constitutive equation for the specific 
free energy: 


9 = Ф(Н?, К, T). (68) 
According to (68) the specific free energy may be considered as a function of 
bi through its invariants bg — 2H and bib; — ў b5 = 2K. Substituting 
(68) in (63) and using (37) one obtains the following form for the reduced 
entropy inequality: 
(1*9 + 0 a? (o g/ob?) b) (Aap + Eap (w° — w°)) + 
+2 T” Ags + (М°# — о E” (og[0b;)) Vais + 
+ M9 was + M?* оз, + М? w3,3 + 
— (4°/Т) Т. — (g?|T) T. — о (s + dp/T) T > 0. (69) 
From (69) constitutive equations for the stress and couple stress can be derived 
immediately. Let us suppose, for the sake of simplicity, that the dissipative parts 
of the stress and couple stress do not depend on the gradient of the rotational 
velocity. This entails non-dependence on the time rate of change of the director 
gradient ?). Assuming now a hemitropic linear dependence on the remaining 
quantities, taking into account the constraint (60) and the non-polarity of the 
director and using the Onsager reciprocity relations, one obtains 
T'E = —e (og[ob;) by — p бк + из) 4: + 
F 3 P2) дау ет? (оз =: Ws) T i ШУУ Qay e? Ts, 
2 Tas = ра) Aaa + Аа) dap E"? T, y, 
M* = в® o (og[obj), 
M? = М?%==— М?З = 0, 
qal T == —0(1) T, + Aa» Aag ef? Аз, 
93/T = —0(2) Т,з — А) (@3 — Ws); 
s = —d9/dT. 


(70) 


No constitutive equations are obtained for 7?* and T°? because of the 
kinematical constraints (38) and (36). The viscosity coefficients xu) the heat 
conductivities oq) and the coupling constants 4(,, may depend on temperature. 
As a result of the entropy inequality their values are restricted by 
Ha 2 0 (i = 1, 2, 3), 
сау 20 @ = 1,2, (71) 
Pa Skye (= 1,2) 
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If the director shows reflection symmetry the coupling constants 2q) should be 
taken equal to zero. 

The constitutive equations (70) are to be used in the conservation equations 
(63) and (64). We will consider here the equation for the conservation of 
angular momentum about an axis along the director, which is given by 


e åh; = e L3 — цо) (юз — w3) — Ао) T s. (72) 


It is not clear how one could generate a non-vanishing component L, of the 
body couple. We therefore take L4 equal to zero. Microscopic considerations, 
moreover, indicate that the local moment of inertia 2 can be taken equal to 
zero at frequencies in the hydrodynamic range. Equation (72) therefore reduces 
to 


Le (93 — из) + Ас) Т,з = 0. (23) 
Since in practice coupling constants have relatively small values, oné may expect 
Афу « Iq 


Therefore, at not too large temperature gradients in accordance with the hydro- 
dynamic low-frequency long-wavelength approximation, 


Q3 — Wa 0 (74) 


(see also ref. 24). Note that substitution of (73) in the constitutive equation 
for 43 yields 


ФТ = —(6 (2) — Ау исз) T,3 ~ —O(2y Т.з. (75) 


The discussion shows that the equation for the conservation of angular mo- 
mentum for rotations about an axis along the director can be discarded. This 
has been done from the outset in existing director theories 1—4). 

According to an argument from statistical physics 8) besides 4 also и, the 
local moment of inertia for rotations about an axis perpendicular to the 
director, may be neglected in the hydrodynamic regime. 

A comparison of theory and experiment in the case of nematics and choles- 
terics shows that for the majority of situations met in practice an expression 
for the specific free energy including only terms at most quadratic in the director 
gradient will suffice. According to this quasi-linear approximation 


Pp = Са) Н? + Cay К. (76) 


The constants Ca) and C(;, may depend on temperature and are related to 
the Frank constants 25) by 


Сау = 21, Соу = —k. (77) 
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4. Static case 


In the static case the equations for the conservation of linear and angular 
momenta (63) and (64) reduce respectively to 


o F* + T*||, — 5; T” = 0, | (78) 
о E° + Т°“||„ + beg T+ ҺР—Р„ = 0 (79) 
and 
o L* + М ||, + e Ts, = 0, (80) 
ІЗ =0. (81) 
Here use has been made of the constitutive equations 
Т° =0, M?=M**=M=0, (82) 


while Р = —T°?. Furthermore, according to p 
T'E = —o (dp/db5) b — (83) 
M* = в о (dy/db}). 
The quantities p and P are pressures acting respectively within and normal to 
a layer. Note that by virtue of (68) 
бФ® = (og[o(H?)) 4 b; dz + pK) (b; 0; — ba). (84) 


Introducing the director body force and the director stress according to (44) 
one has 


Ga = Eag D^, 88 = &, M” = —o (0/005). (85) 
An alternative form of eq. (80) is therefore 
о Ga + Sille — Tsa = 0. (86) 


The shear stress 7?* is seen to act as a quantity coupling the equations of local 
equilibrium (78), (79), (80) or (78), (79), (86). 
Henceforth body forces and body couples will be neglected. Eliminating T?« 
between (78) and (86), and using (83) and (85) one obtains 


e 10/065) 5:11 — e (oe/ob||s ba + Pa = 0. (87) 
According to (84), however, 
pdb) b; = (o ylob;) bz- (88) 


This identity expresses the invariance of y under infinitesimal coordinate trans- 
formations. Equation (87) now simplifies into 


о (0/05) bi]; + P.a = 0. (89) 
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With the aid of the Mainardi-Codazzi equations 


bius = bius (90) 


(+ +2) = 0. (91) 
0/ а ; 


one finally obtains 


Elimination of 73 between (79) and (80) gives us 
Sla + Бк S; b3 + (P— p) 5 — P, = 0. (92) 


When 52 = 0, ед. (92) reduces to the well-known Kelvin formula for the 
surface tension у — p — P. Assuming у to be zero in the interior of the liquid 
crystal — an assumption which can be proved in the compressible case — and 
using (33) and (85) one derives from (92) that 


(v + z) = pbl. (93) 
3 


Equation (91) constitutes a modification, valid for smectic liquid crystals, of a 
known result of the continuum theory of nematics and cholesterics 2) which 
states that the specific free enthalpy 9 + р/о has a constant value through the 
entire liquid crystal in an equilibrium configuration. Here the same quantity 
is only constant on each surface 4 = constant. Equation (93) gives an ex- 
pression for the gradient of the specific free enthalpy in a direction perpen- 
dicular to the Jayers. From (91) and (93) one concludes that in an equilibrium 
configuration (og/ob;)||; has to be constant on each surface 94 = constant. 
This imposes a restriction on the shape of the parallel layers. 

Let us finally specialize the specific free energy to the form (76). Since (76) 
is only valid in the quasi-inear approximation, ie. at small values of the 
director gradient, we obtain instead of (91) and (93) 


Р, = 0, (94) 
e Ca) AH +рз = 0. (95) 


Here AH denotes the second Beltrami operator or surface Laplacian of the 
mean curvature H. Since 2H = AC with С being the deviation of a surface 
#3 = constant from its local tangent plane, the dominant part of (95) is bi- 
harmonic іп £ +°). The constant Ce) only occurs in the boundary conditions. 
A. similar situation is encountered in the theory of nematics and cholesterics 
with respect to the constant k34. 
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Abstract 


Methods for tracking local plane-wave fields with complex phase have 
recently received attention. In the time-harmonic regime and in lossless 
media, such fields are frequently referred to as evanescent; they arise 
exterior to surface-wave guiding structures, on the dark side of caustics 
formed by a focused incident field, in the description of Gaussian beams, 
and in other applications. In the time-dependent regime, complex phase 
implies complex frequency and (or) wavenumber as encountered, for 
example, when Gaussian pulses are injected into a lossless or lossy 
dispersive environment. This paper summarizes current activities aimed 
at the asymptotic description of such fields by a generalization of ray- 
tracing methods developed for local plane-wave fields with real phase. 


1. Introduction 


The use of the ray-optical method for the study of propagation and diffraction 
of time-harmonic high-frequency fields is well established. By this technique, 
one first determines certain trajectories, called rays, along which local homo- 
geneous plane-wave fields propagate. The class of incident rays, descriptive of 
the incident field, is determined by prescribed conditions on an initial surface. 
Abrupt changes in the medium properties, or the presence of scattering objects, 
give rise to reflected, transmitted, or diffracted rays, which are constructed 
according to specified rules. Owing to the local character of high-frequency 
propagation, it is possible to determine the phase and amplitude of the field at 

. one point on a ray from a knowledge of the field at a preceding point on the 
same ray, and from the configuration of the tube of rays surrounding it 1). 

The ray trajectories for the local homogeneous plane-wave field and the sur- 
face whereon the initial conditions are prescribed lie in real coordinate space, 
and the phase as well as amplitude changes of the field along a ray are real 
quantities. While local homogeneous plane waves accommodate many prop- 
agation and diffraction phenomena that are of physical interest, they cannot 
account for wave processes characterized by local plane waves with complex 
phase (inhomogeneous plane waves). Such waves are required to describe prop- 
agation in non-uniform dissipative media; they are also required to describe 
evanescent waves in non-dissipative media. Evanescent waves may be en- 
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countered exterior to surface-wave guiding structures, on the dark side of 
caustics formed by a focused system of local homogeneous plane-wave fields, 
on the optically thinner side of an interface illuminated from the optically 
denser side by totally reflected fields, and in the description of Gaussian beams. 
Although it had been recognized some time ago that the ray-optical treatment 
of inhomogeneous plane waves requires the use of trajectories in a complex 
coordinate space 2), the systematic study of these "complex rays" and the 
associated fields has only very recently been seriously pursued. A principal 
motivating feature is the interest in Gaussian-beam propagation and diffraction, 
which has assumed increased importance with the development and implemen- 
tation of laser optical systems; the fundamental mode of a laser beam adheres 
to the Gaussian profile. 

The ray method has also been employed to study the evolution of time- 
dependent signals in a lossless dispersive environment. In this case, the relevant 
trajectories are space-time rays along which wave packets with real frequency 
and wavenumber propagate ?:*). When the environment is dissipative, or when 
the input signal even in a non-dissipative medium has an exponential amplitude 
dependence (for example, a Gaussian envelope on a harmonic or frequency- 
modulated carrier), the frequency and wavenumber of wave packets become 
complex and the associated trajectories proceed along complex space-time coor- 
dinates (complex space-time rays). Recent interest in complex space-time rays 
has been motivated by the need for a better description of signals propagating 
through the ionosphere from a ground-based or satellite-based transmitter; in 
the conventional treatment, which is applicable only to weak dissipation (but 
may lead to erroneous results even then for certain wave processes ?)), it is 
assumed that a real-ray path may be defined in an equivalent lossless medium 
and the effects of losses added by exponential attenuation along such a path. 

In this paper, we shall summarize some basic concepts and recent develop- 
ments that have occurred in the study of time-harmonic complex-ray fields and 
of transient fields defined along complex space-time rays. 


2. Time-harmonic problems 


2.1. Complex-source-point fields and Gaussian beams 


A particularly simple but very useful example dealing with complex rays is 
provided by the field of a point source at a complex location 6). When the 
source coordinate r' — (y', z') in the two-dimensional free-space Green's 
function 


Gr, r) = : HQXkR, | R-(»—»»*-t-G-—zYY^, (1) 


where К is the wavenumber, is assigned the complex value r' = (0, i b), b > 0, 
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then the analytically continued function С (т, r^, with Re R > 0, continues to 
satisfy the source-free wave equation and outgoing-wave condition. Since at 
large distances r >> k b?, 
_ ibz 
Rer———, r = (у? + 22)1/2, (2) 
г 


and in view of the asymptotic formula Ho ?(x) ~ (2/лх)!/? exp (ix — i л/4), x 
large, the complex-source-point field takes on its maximum value on the positive 
z-axis and decreases exponentially away from that axis. In the “paraxial” region 
у? <2? + b? near the z-axis, 

y? 

2(z—ib)' 
so that the complex-source-point field behaves there like a Gaussian beam whose 
field amplitude decreases away from the z-axis according to 


exp [— k b y? (2z? + 2b2)*1] 


Rw~z—ib+ (3) 


and whose equiphase surfaces advance along +z. The amplitude distribution 
exp (—k y?/2b) in the z = 0 plane identifies the quantity (2b/k)/? as the con- 
ventionally defined beam width. By choosing r’ = i b, where b = (b,, b;) with 
Бу > 0, one may generate a field that decays away from the b-axis, 

This property of the complex-source-point field has important consequences, 
It implies that any field solution, for which the incident field is a cylindrical 
wave, can be converted into a solution for an incident Gaussian beam by 
assigning a complex value to the source coordinate in the Green’s function 
G(r, т”), provided that the analytic continuation of G(r, r’) to G(r, r’) can be 
carried out. The ability to perform the analytic continuation generally depends 
on the particular representation employed for G(r, r^) 7). It also follows (except 
under special circumstances 8)) that by the same device, high-frequency asymp- 
totic approximations for cylindrical-wave diffraction problems, constructed by 
ray-optical methods, can be directly transformed into asymptotic solutions for 
beam diffraction. 

While the replacement of r’ by r’ yields valid field solutions, the interpretation 
of these solutions is not elucidated by the simple process of analytic continuation. 
Especially in the asymptotic high-frequency range, where the real-source-point 
field in a homogeneous medium is known to be describable (outside the “transi- 
tion regions” caused by diffraction) by local homogeneous plane waves that 
travel along straight real rays, the complex-source-point substitution provides 
no comparable explanation. This gap can be filled Бу recourse to complex 
rays °), which carry local plane-wave fields with complex phase (inhomoge- 
neous plane waves) along straight trajectories through complex coordinate 
space; these complex-ray fields become physical at the intersections of the 
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complex rays with real coordinate space. Before discussing complex rays, we 
digress to an alternative, and conventional, formulation of Gaussian-beam 
problems by plane-wave spectral decomposition. 


2.2. Plane-wave spectral synthesis of Gaussian beams 
Assuming that the field in the z = 0 plane is specified as 
u(y) = exp (—k y?°/2b), b>0, (4) 


the field u(y, z) at z > 0 is obtained on multiplying the Fourier transform of 
u(y) by spectral plane-wave functions that satisfy the source-free wave equa- 
tion, and integrating over the spectral wavenumbers 7 in the y domain: 


u(y, 2) = C f exp [ik PQy)] dn (5) 
where C is a factor including various constants and | 


Pin) —75y + х2 + ib 12/2, == a — n?) (6) 


with Im x > 0 and x = +1 at q = 0. 
For large k, the integral may be evaluated approximately by the saddle-point 
method. The saddle points 7, are solutions of dP/dy = 0, viz. 


i Ns | 
m= (2-22) w= @) 
Ь х, 
In the paraxial regime y? < 22 + b?, an approximate solution of (7) is 
ns wy(z—ibt, |1, (8) 
and 
y? 
Pn) © z + ————7 9 
тру (9) 


is then the same as the complex distance (А + ib) in (3). Thus, the complex- 
source-point field and the field generated by the initial distribution (4) have the 
same complex phase (we shall only be concerned with the exponential behavior 
of the field and not with algebraically dependent or constant factors that arise 
from different normalizations of the input conditions). 


2.3. Complex rays 


To provide interpretations of the results in secs 2.1 and 2.2, we look for local 
plane waves which, since the medium is homogeneous, carry the field from the 
source region to the observation point (y, z) along straight-line trajectories. 
Each local plane wave will be characterized by its (fixed) propagation direction 
or, equivalently, by the point of departure of its trajectory. Consider first the 
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formulation in sec. 2.2, which implies via (4) that the initial source distribution 
occupies the entire y-axis. Conventionally, one defines the initial wavenumber 
along y as the derivative of the input phase yp = i y?/2b, which is here complex: 


dy ; 

То = —— = i yolb, (10) 
. dy 

where y, denotes the initial value of the y coordinate. The trajectory, or ray, for 

the local plane wave characterized by the constant parameter 7 is then given by 


y — Уо = (folo) 2. (11) 


Since 2}, in (10) is complex for real yo, it follows that (11) cannot then be satis- 
fied at real (у, z). However, (11) can be satisfied if y and у, are allowed to have 
complex values y and jo, with z real. Then the local plane-wave field character- 
ized by a given value of 7 originates at the complex point y, in the 2 = 0 plane, 
and 


y — o = (1/х) 2, n = i yolb, (12) 


may be taken as the equation of a complex ray in (ӯ,, yi, 2) coordinate space, 
where y, = Re y and y, = Im y. The value of 7 that describes the ray 
passing through the real observation point (j,, 2) is then 


и 


and this expression provides the complex-ray interpretation of ће saddle-point 
condition (7). Thus, the complex phase for the paraxial Gaussian beam can be 
generated by inhomogeneous local plane-wave fields, which propagate with 
complex wavenumbers In| < 1 and x ~ 1 from complex points y, in the z = 0 
plane to the observation point (j,, z). Note that by the complex-ray description, 
the physically specified initial conditions in (4) are extended into complex space, 
a procedure possible only when conditions of analyticity are satisfied. 

In the alternative description of sec. 2.1, the complex phase for the paraxial 
Gaussian beam is generated by the assignment of the complex value (0, i b) to 
the coordinate of a line source. It is therefore suggestive to seek an interpretation 
in terms of inhomogeneous plane waves that travel from the complex point 
y = 0, 2' = i b to the real observation point (j,, Z,). In this description, both y 
and z must be taken as complex, and the general ray equation is 


[uU 
x(y—y)—nG—2»)-0, х= —, = 5р (14) 


where R is given by (1) when all coordinates are complex. When г is real, Ё 
yields the real-space intersections of the complex rays. 
How can the formulation in (14) be reconciled with that in (12), which is 
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based on the physically prescribed initial condition in (4)? The expression in 
(12) follows from that in (14) when 2 is constrained to be real, with y' = 0 and 
2' — i b. Then (14) becomes 


кӯ — ү (2. —іБ) = 0. (15) 
This ray intersects the 7, = 0 plane at 


yo = —i (lx) b, (16) 


which agrees in the paraxial regime x ~ 1 with (12). Thus, the constraint to the 
three-dimensional subspace (7,, Yı Z,) gives rise to an equivalent smeared-out 
source distribution in the z, = 0 plane, which is generated by a point source at 
Ӯ = 0, Z = ib in the general four-dimensional space (j,, ji, Z,, #1). 

These examples illustrate that the complex rays in (12), corresponding to the 
complex extension of a physically specified initial distribution as in (4) (with 
real z), can be more compactly expressed by an equivalent complex source point 
when all coordinates are regarded as complex. Thus, a seemingly simpler (real z) 
formulation is here actually less elegant and more cumbersome to apply than a 
formulation that involves a fully complex space. For Gaussian-beam prop- 
agation and scattering problems, this is a fortunate circumstance since the 
simple device of replacing r' by r' in ordinary Green’s-function solutions trans- 
forms these into beam solutions without the need for doing any ray tracing in 
complex space. For general initial conditions involving fields with complex 
phase, it may be necessary to resort to some type of wave-tracking procedure in 
order to construct the fields away from the initial data. While the construction 
of the complex-ray trajectories and the subsequent local plane-wave tracking 
in a fully complex space appears to provide the most general procedure 19), it is 
difficult to implement, especially in the presence of inhomogeneous media or 
scattering configurations, whose description must likewise be analytically 
extended. Moreover, the full complex space contains much extraneous informa- 
tion since only real observation points, not reached by all species of complex 
rays, are of interest in physical problems. It is therefore relevant to inquire 
whether fields with complex phase can be tracked along suitable trajectories in 


a partly complex or even entirely real space. This leads to the phase-path method 
described in the next section. 


2.4. Real-phase-path method 


When the additional constraint y — y, is imposed on the ray equation (15), 
one observes that only a single point (7,, Z,) corresponds to specified complex- 
ray parameters (7, x). Hence, (ӯ, Z,) trajectories locate the intersections of dif- 
ferent complex rays in real coordinate space. To determine such trajectories 
directly without going first into complex space, one may attempt to formulate 
tracking equations for fields with complex phase y = v, + i y, directly in real 
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(F. Z,) space. When this is done, one finds (in lossless media) that the ray equa- 
tion for fields with real phase is now replaced by two equations 11:12:13): 


d a d E | 
a, A= VB TON (17) 


where В = |V v.|, « = |V |, so and vo are unit vectors in the directions of 
V v, and V Py respectively (with vo . So = 0 in a lossless medium), and f and « 
are connected by the dispersion equation 


В? — о? = п?, (18) 


with n(r) denoting the spatially varying refractive index in the medium. Thus, 
sis the coordinate along trajectories whereon the equiphase surfaces 3p „= con- 
stant advance (on these “phase paths” y, = constant), while > is the (orthogonal) 
coordinate along paths lying in an equiphase surface. Knowing the phase paths, 
one may, from given initial conditions, calculate yy, by integration along them 
(v, being constant), and also determine the amplitude variation A(r) to synthesize 
a local inhomogeneous plane-wave field of the form A(r) exp [ik y(r)]. 

For homogeneous plane-wave fields with y, = 0, one has f = n, i.e., a known 
function. This leads to the usual real-ray equation, the first in (17), which can 
be integrated directly (either analytically or numerically). For inhomogeneous 
plane-wave fields, one has the two similar equations in (17), wherein B and о 
are, however, unknown. The evaluation of these functions and of the corre- 
sponding paths must therefore be linked in a self-consistent procedure. Since В 
and « are transported along orthogonal trajectories, implementation by sequen- 
tial tracking is much more involved than for the case yy, = 0. For weakly evanes- 
cent fields (а « В), it is possible to employ perturbation methods 11:1?) which 
may, however, be range-limited; i.e., their region of applicability may be 
restricted to some neighborhood of the surface of support of the initial condi- 
tions. Alternatively, one may seek an ansatz for the family of phase-path curves, 
motivated by known solutions for special *canonical" problems, and verify 
their validity to a desired order of approximation by substitution into (17) and 
(18). It may be noted again that determination of the phase paths constitutes 
the principal difficulty; once these paths have been found, the field calculation 
Is straightforward. 

The procedure outlined above is appealing because it tracks, in real coor- 
dinate space, the evolution of a field specified by initial data with complex 
phase. The calculation is directly concerned with the deformation of phase 
fronts, the changes in exponential amplitude profile as determined by y,, and 
changes in the algebraic amplitude Д{т); it therefore provides physical insight 
into the propagation processes that establish the field at the observation point. 
The same understanding does not follow from the complex-ray approach since 
the field reaches the observation point along a path in the physically inaccessible 
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= 


complex coordinate space. However, after the field has been found by the 
complex-ray method, one may a posteriori deduce the phase paths, etc., from 
the solution 1°), and thereby inject the physical content indirectly. 


3. Time-dependent problems 


The procedures described in sec. 2 have their direct counterpart in the time- 
dependent, dispersive regime, where the local plane-wave fields are now wave 
packets. Wave packets with real frequency w and wavenumber К are transported 
through space-time along real space-time rays; when the frequency or wave- 
number is complex, the trajectories are displaced into complex (т, 7) space. 
Since frequency at a fixed observation point is defined as w = —d$/dt, where ¢ 
is the phase of an oscillating field, a complex initial frequency profile may be 
generated by an input signal with complex phase $. The temporal counterpart 
of (4) is a Gaussian input pulse with amplitude variation exp (—t?/2b), where 
b is now a measure of the pulse width. By the same considerations as in sec. 2.2, 
one concludes from a Fourier analysis in time that the field at a distance z from 
the input plane z = 0 can be expressed by an integral as in (5), provided that К is 
replaced by unity, 7 by о, y by t, and х by the wavenumber k(w) descriptive of 
the dispersive properties of the homogeneous medium (the integration path is, 
however, chosen so as to ensure causality of the field solution). The asymptotic 
field solution can then be interpreted in terms of wave packets with complex w, 
moving along trajectories in a complex (f,, ñ, 2) space; the intersections of these 
complex rays with теа] (/,, 2) space furnish the observable time-dependent 
fields +4). 

As for time-harmonic Gaussian beams, it may be possible to generate Gauss- 
ian pulses by replacing the real source point (r', t’) in a time-dependent Green's 
function by the complex value (r', 7’). This has so far been confirmed for plane 
pulses in a lossless isotropic plasma medium !?). 

For the same reasons as mentioned in sec. 2.4, it is relevant to search for 
methods that permit the tracking of wave packets with complex frequency in 
real space-time 15:17), However, as in the time-harmonic case, the general 
applicability of such physically appealing approaches remains to be further 
explored 15). s 


4. Concluding remarks 


While only two decades ago, fundamental questions concerning the rigorous 
formulation of diffraction problems were being clarified 1°), the use of asymp- 
totic methods, systematized by ray techniques, has led to a methodology that 
makes the analytical determination of the propagation and diffraction char- 
acteristics of fields accessible even to applied scientists and engineers with little 
formal mathematical training. Ray methods for time-harmonic fields with real 
phase have found wide application in diverse areas in electromagnetics and 
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acoustics. Space-time ray techniques for real-frequency pulse propagation in 
lossless dispersive media, while of more recent origin, are also making an impact. 

Because of the demonstrated utility of real-ray techniques, it is appropriate 
to explore complex-ray techniques and their various ramifications as discussed 
in this paper. At the time of writing, several questions remain to be resolved. 
Once this has been done, one may hope to develop a methodology for complex 
rays which, though more complicated than that for real rays, will provide a 
working tool that is manageable for users without a strong background in 
applied mathematics. It will thereby become possible better to treat time- 
harmonic and time-dependent fields with complex phase, which are now finding : 
increased application in various disciplines. 
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1. An inductive way to compute the number of elements of geometries over a 
field with g elements, belonging to Lie group graphs *), will be described. In 
order to simplify the formulae, the abbreviation 


[x] = 4*—1 


is used. g = 1 is allowed (the geometries of Weyl groups); then expressions like 
[x y]/ [v] will be understood in an obvious way. 


2. Let Г be a finite incidence geometry on a graph and let o, o be different dots 
of the underlying graph (for instance Г = projective /-ѕрасе, Г(о) = set of 
sub-i-spaces, Г(о) = set of sub-j-spaces with i < j). Consider the set of in- 
cidence pairs a, b with a e Г(0), b e I'(o). Its cardinality can be computed іп 
two ways (with ао є Г(0), bo € I'(o) fixed): 


card Г(о) · card (b e Г(о) | b incident with ao} 


= card Г(о) · card (a e I'(g) | a incident with bo} 

(in the example 

number of sub-i-spaces - number of sub-j-spaces around a fixed sub-i-space 
= number of sub-j-spaces : number of sub-i-spaces within a fixed sub-j-space). 
Now by definition the b є Г incident with ао (and # ao) form an incidence 
geometry Г mod a, (in the example the sum of the geometries around and 
within ао, that is of projective i-space and projective (/—i—l)-space), and a simi- 
lar statement holds with о and о interchanged. So the ratios 


card Г(о) [ card Г(о) 


are completely determined as soon as the cardinalities in the “lower” incidence 
geometries are known, which happens to be the case in an inductive approach. 

It will be shown how beyond the ratios of the Г(о) their exact values can be 
found by suchlike inductive means. Of course, they can be easily read from 


*) See, for instance, the definition of Tits geometries in H. Freudenthal and H. de Vries, 
Linear Lie groups, Academic Press, New York, 1969, section 71.1. 
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Chevalley’s formula +) for the cardinalities of finite algebraic groups; on the 
other hand, our approach can be interpreted as another way to confirm 
Chevalley’s formula. 

The inductive approach will be first explained with A, (projective spaces), and 
D, (quadrics) though in these cases easier methods are available. 


2 


e 
Fig. 1. 


3. A, : Given a fixed point (an element of Г(1)), all other points are characterized 
by the existence of a unique joining line (an element of Г(2) incident with both 
of them). So two kinds of unique chains are possible: 


11, 
121. 


There is one chain of the first kind, and if by inductive supposition the number 
of points for A,_, is 


Uu [1] 


the number of chains of the second kind is ([/]/ [1]) ([2]/ [1] — D) = q [ДГ [1]. So 
the number of all unique chains and consequently the number of points for А; is 


card Г(1) = [/ + 1]/ [1]. 
From this it follows easily that 
card Г) = [7+ 11]: :: ( 4-2— il, [11 021: 0, 
which will be used later on. 


4. D,: In order to compute the number а; of points (elements of /'(1)) unique 
chains are considered, starting from a fixed line (element of Г(2)) and ending 


Ec 


Fig. 2. The numbering of the nodes differs from that in the earlier-quoted book. 


at a variable point. The number of these chains equals the number of points. 
There are three kinds: 


(a) 2 1. 
(b) 2 3 1. 
() 2121. 
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That is, points fall into three classes; incident with the fixed line, joined to the 
fixed line by a plane, and "general" points, connected to the given line by а 
unique line. Their cardinalities: 


(a) [2]/ [1]. 
(b) The number of all chains 2 3 1 is, by inductive supposition, a, ?[3]/ [1]; 
the non-unique ones among them arise from chains 2 1. Any chain 2 1 can be 
completed according to the pattern of fig. 3 in a,..; ways. So the number of non- 
unique chains is a;..5[2]/ [1], and the number of unique ones is 
4? 4-2. 

(c) The number of all chains 2 1 2 1 is 

((2]/ (1) а, - ((2]/ (1). 


Non-unique ones may arise from a 2 1 or from a 2 3 1 (unique). The possible 
completions of 2 1 to 2 1 2 1 are shown in fig. 4, those of 2 3 1 in fig. 5; the 
intersection of the two sub-cases of fig. 4 consists of one chain. The number of 
chains according to fig. 4 is 


(2]/ 19 (а. + 217 [1] — 1); 
the number of chains according to fig. 5 is 


q? a2 [2)/ [1]. 


ANNA «үу 


So the number of unique chains 2 1 2 1 is 
a, = (21/19 a- 27 1] — [21/11 @-1 + 217 E13 — 1) —2? а» [2]/ C0] 
=4994+ 1 4-1-G 4-2-9 +1. 
From this recursive formula it follows that 
card Г(1) = [27 — 2] [/]/ [| — 1] [1]. 
The other J'(i) are easily derived from this: 
card Г) = [2/ — 2] [27 —4]--- r—2i]- I] i — 1] -- [[— i+ 1]/ 
10—102]: ::0-— 1]: (H1; E]. fori «i— 1, 
card Га) = [21— 2] [21— 4]: · · 2. — 1] I — 2] - - Ш. 
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5. Eg: In order to compute the number of points (elements of Г(1)), unique 
chains are considered starting from a fixed and ending at a variable point. The 


w 
е 


1 2 3 4 5 
Fig. 6. The numbering of the nodes differs from that in the earlier-quoted book. 


number of these chains equals the number of points. There are three kinds: 


(a) 1 1. 
(b) 1 2 1. 
(c) 1 5 1. 


Their cardinalities: 
(a) 1. 
(b) The number of all chains 1 2 1 is (by the use of 7'(5) for D,): 
([8] [6] [4] [21/ [4] [3] [2] [1 - (237 (1), 

the non-unique ones contributing 

([8] [6] [4] [2]/ [4] [3] [2] (1), 
the remainder being 

q [8] [6] [4] [2]/ [4] [3] 2] [1]. 
(c) The number of all chains 1 5 1 is (by the use of Г(1) for Ds, and 71) for 

A4) 
(81 [5]/ [4] [LD - (18) [5]/ [4] [1]). 

The non-unique ones may arise from a 1 1 or from a 1 2 1 (unique). The 
possible completions of 1 1 and 1 2 1 (unique) to 1 5 1 are shown in figs 7 
and 8, respectively. 
The contributions are 


[8] [5]/ [4] [1] 


and 


(4 [8] [6] [4] [2]/ [4] [3] [2] 1 · (517 (17), 
respectively, the remainder of unique chains being 


4% [8] [5]/ [4] [1]. 


So the number of all points is 
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PAS 
EA b 
Fig. 7. Fig. 8. 
1 + q [8] [6] [4] [2]/ [4] [3] [2] [1] + 9° [8] [5]/ [4] [1] = [12] [9]/[4] [1]. 
From this follows the cardinality of Г(о): 
[12] [9] [8]/ [4] [3] [1]. 

In terms of the Chevalley formula this can be read. as 

card E,/card A, = [12] [9] [8] [6] [5] [21/ [6] [5] I4] [3] [2] [1]. 
For 4 = 1 this gives the number of “sixes” in the theory of the 27 lines on the 


cubic surface. 


6. E,: In order to compute the number of points (elements of 7'(1)) the unique 
chains are considered starting from a fixed element of Г(6) and ending at a 


£ 


Fig. 9. The numbering of the nodes differs from that in the earlier-quoted book. 


variable point. The number of those chains equals the number of points. There 
are three kinds: 


(a) 6 1. 
(b) 6 o 1. 
(c) 612 1. 


Their cardinalities: 


(a) [10] [6]/ [5] [1], 
by the use of Г(І) for Dg. 
(b) The number of all chains 6 о 1 is, by the use of Г(6) for Dg, and Г(1) for 
Ав: 
([10] [8] [6] [4] [21/ [5] [4] [3] [2] [1)) - (171/10). 


The non-unique ones arise from a 6 1 in the way of fig. 10. By the use of Г(5) 
for D,, their number is 
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x 


Fig. 10. 


([10] [6] / [5] [1) - 18] [61 [4] [21/14] [3] [2] [1]), 


which boils down on replacing the factor [7] / [1] in the first formula by [6]/ [1]. 
So the remainder is 


q^ [10] [8] [6] [4] [21/ [5] [4] [3] [2] [1] = (a5 [10] [6] / [5] [1)) - ([8]/ [3]. 


(c) The number of all chains 6 1 2 1 is, by the use of Г(1) for Бе, and I1) for 
Ес: 


([10] [6] [5] [1 - (12) [91/ [4] (1) - (121/1). 


The non-unique ones may arise from a 6 1, or from a6 œw 1 (unique), according 
to figs 11 and 13, respectively, the intersection between the two sub-cases of 
fig. 11 being shown in fig. 12. The contribution for fig. 11 is, by the use of I) 
for D, 


(10] [6]/ [5] (10) > ((12) [9] / [4] (1] + (181 [5]/ [41 (10) (2]/ [1D — [8] [5]/ I4] (1). 
The contribution for fig. 12 is 
(9° [10] [6]/ [5] UD ((81/ [3) - ((6]/ [1]. 
The remainder is 
[10] [6]/ [5] [1] - (g [12] [9]/ [4] [1] — [8] [5]/ [4] [1] — a5 [81 [6]/ [3] 1) 


= 4*7 [10] [6]/ [5] [1]. 
The sum of the numbers of a, b, c is 


card Г(1) = [18] [14] [10]/ [9] [5] [1], 
which in terms of Chevalley's formula can be read as 


card Ёз / card Eg = [18] [14] [12] [10] [8] [6] [2]/ [12] [9] [8] [6] [5] [2] [1]. 


AT E 


Fig. 11. | Fig. 12. Fig. 13. 
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An easy computation gives 
card Г(6) = [18] [14] [12] / [6] [4] [1] 


(for q = 1 twice the number of Steiner systems in the theory of the quartic 
curve), and 


card Г(ф) = [18] [14] [12] [10] [8]/ [1] [3] [41 [5] 17] 


(for q = 1 twice the number of Aronhold systems). 


6’. Another approach to E, is by chains from a fixed element of Г(о) to 
a variable point. The cardinalities of the sets of unique chains then are: 


wl : (7)/ 01, 
o51 : 95 {7][6]/[1][2], 
w 231: 4'2 [7] [6)/ [1] 2], 
w 6 œw 1: g?* [7]/ [1]. 


(Details of the computation omitted.) 


7. Ев: In order to compute the number of points (elements of 7'(1)), the unique 
chains are considered, starting from a fixed element of Г(7) and ending at a 


w 

e 
—— M € ———— € —____- 9 —————— € —————— ө 
1 2 3 4 5 6 7 


Fig. 14. The numbering of the nodes differs from that in the earlier-quoted book. 


variable point. The number of these chains equals the number of points. There 
are five kinds with the cardinalities 


71 г [12] [7]/ [6] [1], 

701 г q7 [12] [10] [8] [6] [4] (2]/ [6] [5] [4] (3) 21 (1) 
7231 :4!*[12] [10] [7] [6]/ I6] [5] [1] [2], 

7 67 w 1: q?? [12] [10] [8] [6] [4] [21/16] [5] [4] [3] [2] (1). 
7171 :4*5 [02] ]/ I6] [1], 


the sum of which is 
[30] [24] [20] / [10] [6] [1]. 


(Details of the computations omitted.) 


7'. Another approach to Eg is by means of chains from a fixed line (element of 
I'(2)) to a variable point: 
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21 : DUI 
231 : q? [12] [9]/ [4] [1], 


i= 
YN D : q™ [12] [9] 21/047 111, 


2121 : q?* [2]/ [1], 
2061 : 418 [12] [9] [8]/ [4] [3] [1], 


NN IN 
"d 2 \/ 1 : q?? [12] [9] [2]/ [4] 1] (11, 
qos ON 


27231 : q?? [12] [9]/ [4] [1], 
212121 : 456 [2]/ [1]. 
8. C4: 
1 2 3 
Fig. 15. 


card Г(1) = [6]/ [1], 

card Г(2) = [6] [4]/ [1] [2], 

card Г(3) — [6] [4]/ [1] [3], 
are easily found. 


9. F4: In order to compute card Г(а), unique chains are considered starting from 
a fixed element of J'(d) and ending at a variable element of 7'(a). There are three 


kinds, with cardinalities 
da : [6]/ [1], 
dbca: q* [6] [4/0] [2], 
dada: q!? [6]/ [1]: 
together 
[12] [8]/ [4] [1]. 


(Details of the computation omitted.) 


204* : . H.FREUDENTHAL 


9’. Another approach to F,, by chains from an element of Г(с) to element of 
I'(a): 


ca : [3]/ [1], 
cda : 4? BIBI I - 
caba : q$ [3] [2]/ [1] [1], 


cada :q? [31[21/[1][1], 
chdbea: q”? BIU. 


10. G4: The unique chains and cardinalities are 


21 : D 
2121 : q? [21111], 
212121: q*[2]/[1); 
together 
[6]/ [1]. 


11. Conclusion — a conjecture 


The preceding numerical results provide richer information than Chevalley's 
formula, namely the cardinalities for every kind of unique chains starting from 
a fixed element (that is for the orbits under the stability group of the fixed 
element). They strongly suggest a simple law for these cardinalities, which 
would allow one to get these numerical results (and many others) with virtually 
no computation. No doubt every attentive reader will be able to formulate this 
conjecture, and perhaps someone will be able to prove it. I am afraid I cannot 
find the leisure to look for a proof. 
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1. Introduction 


When the four authors had written their paper on perfect rectangles (Brooks 
et al. !)) they speculated on possible generalizations, and in particular on the 
theory of the dissection of equilateral triangles into equilateral triangles. 

The latter theory proved closely analogous to that for dissections of rectangles 
into squares. The rectangular theory required consideration of electrical fiows 
in networks of unit conductances. The triangular theory involved flows of some- 
thing like electricity in directed networks of unit conductances. 

In the triangular theory the direction assigned to an edge or wire has nothing 
to do with the direction of the current in it. As usual a current in a wire can be 
considered as positive in one direction, or as negative in the opposite direction, 
but of the same magnitude. The current in a wire, counted in the direction 
assigned to that wire, can be either positive or negative, or of course zero. 
Kirchhoff’s Second Law, that the algebraic sum of the currents around any 
circuit is zero, remains applicable. But his First Law is replaced by the statement 
that in the edges directed from a vertex the algebraic sum of the currents counted 
as flowing away from that vertex is zero (but not necessarily in the edges 
directed towards that vertex). As usual two vertices called the “source” and 
“sink” are exempted from the First Law. At them current is supposed to flow 
between the network and the outside world. 

In this paper we refer to our electric flows as “leaky”. "We think of the cur- 


206* R. L. BROOKS, C. A. B. SMITH, A. H. STONE AND W. T. TUTTE 


rent in a directed edge, in the direction of that edge, whether positive, negative 
or zero, as taking its full magnitude at the initial vertex of the edge, but as 
leaking away and finally diminishing to zero at the terminal vertex. 

We explain the theory of leaky flows and its application to triangulations. 
We find that a triangulation can be associated with three directed networks, 
that the sizes of its constituent triangles can be determined as the magnitudes 
of currents in these networks, and that these currents can be evaluated as de- 
terminants in terms of the structure of the networks. This was already proved 
by Tutte *), but the present paper looks at the theory from a different point 
of view. 

The three directed networks can be combined into a single bicubic map. 
The latter part of the paper relates certain matchings in this map to the span- 
ning trees of the directed graphs. In particular it is shown that the number of 
spanning trees in a directed graph, directed towards a particular vertex, is the 
same for each vertex and each of the three directed graphs. This triple tree 
number or “diplexity” is interpreted as the number of matchings of a certain 
kind in the bicubic map. 


2. Definitions 


The following definitions apply throughout unless the contrary is explicitly 
stated. Every graph is finite (possibly containing loops and/or multiple edges). 
If to each edge e, of a graph G there is assigned a real number (conductance) с 
we obtain a network Г. (If all c, = 1, we can without confusion use the symbol 
G for this unweighted network, and call it a graph.) A map Mg is the two-com- 
plex got by embedding a (planar) G in the plane (or sphere). (No two edges meet 
except at a common end vertex.) Without loss of generality we take each edge 
to be a union of straight segments. Directed graphs are indicated by the prefix 
di- (digraph, dinetwork, dimap). An edge e, from vertex v, to vertex v , may be 
written ej, or е,# if there is more than one such edge; its conductance is c, j 
or c, respectively. A tree Т in a digraph is directed towards (away from) a 
root v, when each v, in T is joined to v, by a path in T consistently directed 
respectively towards (away from) v,. A spanning tree contains all vertices of 
the graph. 


3. Complexities 
Let E be any set of edges. The conductance product of E is 
IIc [E] — IIc, (e, € E), (3.1) 


with Пс [2] = 1. The weighted complexity of a network N is the sum of con- 
ductance products of spanning trees (Smith 3)): 


Wx [N] = У Пе [T] (Ta spanning tree). (3.2) 
T 
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The (unweighted) complexity (all c, = 1) is therefore the number of spanning 
trees (Brooks et al. *)). 

In a dinetwork A with root v, we define analogously the weighted implexity 
(eplexity) with regard to this root as 


Wix (or Мех) [4, v,] = У Пс [T], (3.3) 
T 


where T varies over all spanning trees directed respectively towards (or away 
from) v,. We have 


(3.3a). The weighted complexity, implexity, eplexity are unaltered on adding 
or removing loops and edges of zero conductance, or by combining several 
edges e,,1, ..., ej" joining v, to v; into a single edge e,,* of conductance 
Cy* = сї +... -+ Cy", or by the inverse operation of splitting one edge 
е“ into several е". 


(3.3b). A weighted doublet d (in a digraph D or dinetwork A) is a pair of 
oppositely directed edges е;“, ез“ of equal conductance c, joining the same 
two vertices v, v. 


We define R,D, С.р (or R,A, C,4) as the graph (network) obtained by re- 
spectively removing or contracting d, in the obvious sense that R, simply deletes 
eij’, ед“ from the digraph, and C, in addition identifies v, with оу. We then 
have 


Wix [A, »,] = Wix [R,4, v,] + c4 Wix [C,A, v,], (3.3c) 
Wex [A, v,] = Мех [R,A, v,] + ca Мех [C,A, v,]. (3.3d) 
(3.3e). If the set of vertices is partitioned into sets 0”, U”, and there is no 
edge from a vertex of U’ to a vertex of U”, then 
v, e U' => Wex [4, ,] = 0 
and 
v, E U” = Wix [A, v,] = 0. 


In particular, the eplexity, implexity (and complexity) of a disconnected A are 
zero with regard to any root. 


(3.3f). If the set of edges of A is the disjoint union of a set of doublets, we 
can construct an undirected network Г = Und [4] by replacing each doublet 
by an undirected edge joining the same end vertices. 
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4. Balanced dinetworks 


A is balanced if for each v, the sum of the conductances of edges from v, — 
the sum of the conductances of edges to v,: 


(Vy, , X Cia” = Ў: Cpi". (4.1) 
| [27 Bou 
Note that the Л of remark (3.3f) above is balanced. 


Theorem (4.2). If Л is balanced, the implexities and eplexities with regard to 
all roots have a single common value, the (weighted) diplexity Wdx [4]. 

Tutte ^) gave a proof of this, using determinants. We present a purely topo- 
logical proof. Firstly we establish 


Lemma (4.3). If the vertices of a balanced dinetwork A are partitioned into two 
non-empty non-overlapping subsets U’, U”, then for v, e U’, v, e U”, 


Wix [4, v,] = Мех И, vs]. 


Proof. This may be taken as trivially true if the number n, of vertices < 2. 
Let т = the number of edges from U’ to U”; then if m = 0, (3.3e) implies 
(4.3). We apply induction over p = m + n,, having shown that (4.3) holds if 
p = 0. Assuming it true for p < po, take p = po, and m > 0. There is at least 
one edge е," from U’ to U”; add a second (oppositely directed) edge е", 
also with conductance c,,", and a third @,,", with conductance —c,," cancelling 
the second, thus not altering the eplexities or implexities. Here ej", ең" are 
a doublet A, hence by (3.34) and (3.3e), 


Wix И, v,] = Wix [R,4, v,] + ct Wix [С,4, »,], 
Wex [4, v,] = Мех [А,4, vs] + ci” Мех [С„4, vs]. 
But by the inductive hypothesis, the right-hand sides are equal; hence 
Wix (4, v,) = Wex (A, vj), 


completing the induction. 

Now if n, <2 (4.2) is easily verified directly. If n, > 3, and v, vj, v, are 
distinct nodes we take U’ containing v; vj, and U” containing v,, obtaining 
from (4.3) | 


Wix [A, vi] = Wix [4, vj] = Wex [4, v, ]. 


By permuting i, j, k (4.2) follows. 

Note that if A in (4.2) is balanced and connected, the total conductance of 
all edges from U’ to U” = the total from U” to U’. If all conductances are 
positive, a maximal forest is spanning and hence a spanning tree. This proves: 
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Theorem (4.4). If all conductances in a balanced connected dinetwork are posi- 
tive, its diplexity is positive. 

Note also that if 4 is balanced, and v, dus only two edges eij, €n incident 
with it, c, = Сы. We define a new balanced A’ by omitting v, replacing 
eij €, by a new edge e,,° with Caj? == Ciy = сы. Then the number of trees 
directed outwards from v, is unaffected by this modification, so that 


Мах [4’] = Жах [4]. | 4.5) 


5. Leaky electric flows | 

Let A be a balanced dinetwork with 2 or more vertices. Choose any two 
(distinct) vertices, say vo, v,, to be sink and source. Let 

€ = У с; jl. 
k 
Suppose we can assign to each vertex v, a potential V, and to each edge e, к 
a current . 
АУ = с (И, — V) (5.1) 

such that for each v, (other than vo, v,) the total current flowing along edges 


directed from v, is zero (^Kirchhoff's First Leaky Law"): 
0 = b» f= by аў Qu У) = 2; cy (, — Vj) (20,1, (5.2) 
F; . 


Jk Jk 


then the currents Г, form a leaky electric flow. 
Note that by (5.1) Z,* = 0. We set Лу = I Lf, so that 7,, = 0 (all i) and 


У, =0 (i 4 0, 1). (5.3) 
J 
1, = 2I ,; and Г = —Z Ioj are the currents flowing respectively in at the 
source and out at the sink. Since A is balanced, for each v, 
2 с) = >, Cjis | (5.4) 
J J 


so that on summing over all v, v; 


Ys = Yew (И, — Vy) 


tjJ tJ 


= D (и eu) =È cy Vy 


iJ 


= È си V,—» с V,=0. 
LJ ^n 
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By (5.3) this implies 
Ip = 1, = І (say) (5.5) 


the total current through A. Clearly if І, is a (leaky) current flow, so also is 
A I, for any constant 2. Conversely: 


Theorem (5.6). If, for every edge e,, the conductances c,/* > 0, the currents 7; 
are unique to within a constant multiplier. 

Proof. Suppose otherwise, that J, ;*’, I," are flows which are not proportional. 
Then for any a’, a", 


LE =d hf +a" hf (5.7) 
is also а not identically zero flow (with potentials V,*, say). We can choose 


a’, à" not both zero such that J)>* = 0 (and hence J,* = 0), and hence Kirch- 
hoff’s First Leaky Law holds at all vertices v,. Then 


Усу V — Vit)? = Y, (су Vj? — 2 су Vik Vj* + e V,*?) 
iJ 


ij+i 
= Ӯ; У: (cy V,? —2 ciy Vi* VŽ + ey y,*?) 
i j#t 
= X (ие у сп – У, 2 С] V;* Vy -+ 5; С) n") 
i jei J#t 1#1 i 
=>}, (re Yo -X2o Vit Vf У, си re) by (5.4) 
t J*i jii 1#1 
=2 у (r 2; cy (VF — e 
i 1#1 
= 0 by (5.2), 


implying V,* = V;* (all i,j), i.e. 7,/* = 0, a contradiction. Hence the theorem 
is established. (Similarly if all c, < 0.) 


We now give an explicit construction for a particular solution of the equa- 
tions, the full or unreduced potentials V, and currents Zj. Define a spanning 
two-tree directed to roots vo, v, as a set of edges in which from any vertex v, 
there exists either just one directed path from v, to vo, or one path from v, 
to v,, but not both. For brevity, we call it simply a “two-tree” t (with vo, v, 
understood), and we write œw[v; t] = 0 or 1 according as the path (possibly 
of zero length) from v, goes to vg or v,. 

Let 

V, = Y Пс [t] olv, t] (5.8) 
t 


(summed over all two-trees /). It follows that Vo = 0, and we call 


V,=V,—-—Vo 
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the (full) potential drop across the dinetwork A (with respect to the chosen 
o, vı). It is the implexity of v, in the dinetwork obtained from A by identi- 
fying vo, v,, and hence also the diplexity. Also, if all с; > 0, V, 7 0. 

We define the (full) current 1,} as follows. Let Т" be a typical spanning 
tree directed outwards from root v,. In Ту" there is a unique path p," from 
v, to v. Set (е, Тү") = 4-1, —1, 0 according as е, lies in p,9"p;," (case о), 
Di" Mio" (case B), or otherwise (case у), and set 


I T = È Пе (T;") Ке, Тү"). (5.9) 


Now in case a, a deletion of е,“ from Тү" gives a double tree t in which v, 
is connected to v, and v, to vo, and По(тТ") = c,;*Iic(t). But by (5.8) this 
double tree contributes IIc(t) to V; — V;, and c; Пс(ї) to J,*. In case f the 
contributions are —IIc(t) to V, — И, and —c;fIlc(f) to Lf. Other Т" 
contribute nothing to J,;*, and other ¢ contribute nothing to И; — У. Hence 
on summing all contributions, (5.1) is verified. Now take any fixed v, i 5 0, 1. 
For any tree T,” there are contributions to two J,;* only, namely when e, is 
the edge by which either р” or p," leaves v,. These two contributions are 
equal and opposite, so the contribution to PER f = 0. Hence, on summing 
over all contributions, (5.2) is verified. 

Note that 7, — Ah f = the eplexity of v, = the diplexity of A. 

Tutte ^) has given an alternative expression of these results in determinant 
form, generalizing the results of Kirchhoff ?) and Brooks et al. !). Define a 
matrix d, as follows: 


ayy = —Cy; (2 5 Ј); dy = p» Ci. (5.10) 
j 


Then the diplexity 7, of A = any first cofactor of d. The potential difference 
V, — V, = the second cofactor deleting rows 0, 1 and columns j, i. 


6. Triangulated rectangles 


Let A be plane and balanced. Tutte ^) then gave a further interpretation of 
leaky current flows as dissections of a rectangle (or parallelogram). 

Consider a rectangle R divided into triangles in the manner shown in fig. 1. 
We can construct a digraph D[R]. Each horizontal line (h4, A2, ho) in R is 
mapped into a corresponding vertex (v,, v2, vo) in D[R]. A triangle with base 
on line h, and opposite vertex on line Л, is mapped into an edge from v, to vj. 
Now imagine a current of uniform density flowing vertically downwards in R. 
This maps in the obvious way into a “leaky current flow” in D[R], such that 
the potential V, of v, = the vertical height of the line /,, the current in an 
edge = the width of the corresponding triangle in R, and the conductance of 
the edge — width/height of the triangle. The converse result, proceeding from 
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dinetwork to dissected rectangle, is established by Tutte +). In either case the 
width of the rectangle R = the total current J, in the dinetwork = its di- 
plexity, and the height of R = A; — ho = V, — Vo = total potential drop. 

From now on we restrict ourselves to dinetworks A which are plane and 
alternating, that is, such that the edges incident with any vertex vj, considered 
in clockwise order, are directed alternatively inwards and outwards. АП con- 
ductances are taken to be 1; A is then balanced. The corresponding dissection 
is into isosceles right-angled triangles; or, by a shear, it is a dissection of a 
parallelogram into equilateral triangles. This readily leads (Tutte +) to a 
theory of a dissection of an equilateral triangle A into equilateral triangles. 
For by attaching an extra equilateral triangle onto one side of A, we obtain 
a parallelogram dissected into equilateral triangles. But such a dissected triangle 
can be viewed as such in three ways (by rotating through 120°), leading to an 
idea of “triality”, which we now consider further. 


7. Bicubic graphs 


A bipartite graph G (in which the vertices are colored black and white, with no 
two of the same color adjacent) with all vertices trivalent, is bicubic. All circuits 
are even; conversely a trivalent graph with even circuits is bicubic. Each edge 
is naturally directed from black to white. Hence 


No. edges = 3x no. (black vertices) = 3 xno. (white vertices) 
= 3 B[G] (say), (7.1) 


and there are 2 B[G] vertices. If every connected component of G is bicubic, 
so is G, and conversely. No (finite) bicubic graph contains an isthmus, for 
deleting the isthmus and considering only the part containing its black end 
vertex gives the equation 
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no. (edges) = 3x no. (white vertices) = 3x no. (black vertices) — 1, 
which is impossible. 


8. Bicubic maps 


A bicubic map M is the embedding of a bicubic graph G in the plane: we 
write 8[M ] = B[G]. A trivalent map is bicubic if and only if each face has 
an even number of sides. If G has K connected components, then Euler’s 
Theorem with (7.1) shows that 


no. of faces (including exterior) = f[M] +K +1. (8.1) 


Since there are 3 [M ] edges, each incident with 2 faces, the average number 
of edges per face is 6 B[M]/(B[M] + K + 1) < 6. Hence M contains some 
face with 2 or 4 edges (digon or tetragon). 

The dual DM of a connected bicubic map M has all vertices of even valence, 
and all faces triangular and colored black and white. Each edge separates faces 
of different colors. The dual natural orientation of the edges of DM runs con- 
sistently round each face, say anti-clockwise round black faces, clockwise round 
white ones. Let л = v, v; Vp... V, be any directed path in DM. Define the 
signed length of any edge eij, ej, . . ., etc. in л to be +1 (or —1) respectively 
according as its direction agrees with the natural orientation or not; and the 
signed length of л as the sum of the signed length of its edges. The perimeter of a 
(triangular) face has signed length = + 3 = 0 (mod 3). A simple circuit has 
signed length — the sum of the signed lengths of the faces within it, and 
hence == 0 (mod 3). A non-simple circuit can be broken into simple circuits, 
hence its signed length == 0 (mod 3). Hence, all paths connecting any two ver- 
tices v, and v, have the same signed length (mod 3). Hence the vertices of DM 
fall into 3 classes, such that no two vertices of the same class are adjacent, i.e. 
there is a 3-coloring of the vertices (unique apart from permutation of the 
colors, say Green, Red, Yellow). Hence there is a 3-coloring of the faces of M, 
such that the colors G, R, Y occur in respectively anti-clockwise (clockwise) 
order around the Black (White) vertices. (А disconnected bicubic map has 
several such face colorings.) Conversely, a trivalent map with a face 3-coloring 
is bicubic. (Color a vertex black or white according as С, К, Y occur in anti- 
clockwise or clockwise order round it.) This induces a natural 3-coloring of 
the edges, giving each edge the color different from those of the 2 incident 
faces. The edges meeting at a vertex have all 3 different colors. (Other edge 
colorings may exist.) 


9. Matchings of M 


Choose a fixed black vertex v, as "root vertex" and its 3 adjacent faces as 
"root faces". Then if M is connected, K — 1, hence by (8.1) the number of 
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(non-root) black vertices = 8[M] —1 = the number of (non-root) faces. A 
pairing of each non-root face with an adjacent non-root black vertex (if such 
is possible) is a vertex—face matching. If M is disconnected, K — 1, and no 
such matching exists. The principal result of this paper is 


Theorem (9.1). The number of vertex-face matchings in a bicubic map, u[M ] is 
independent of the choice of root vertex v,. It is unaltered by interchanging 
the vertex colors, black and white. It is positive if M is connected, otherwise 
Zero. 

The proof will be given later. First we investigate other properties of bicubic 
maps. From now on we suppose all maps connected, unless stated otherwise. 
Results concerning disconnected bicubic maps usually follow straightforwardly 
from those for connected maps. 


10. Digons 


The simplest bicubic map is the trihedron, with one black and one white 
vertex joined by 3 edges. 

Let M be a bicubic map, other than a trihedron, containing a digon (a face with 
two edges e,,', e,j?, where, say, v, is black). The third edges at vertices v, vj 
will be v,v,, vjv,, where Uns Vi, vj, v, must all be distinct, and the faces are 
as shown in fig. 2a. Then 


(10.1). There is a (1 : 1) correspondence between matchings with root v, 
(in which v, must necessarily be paired with face f) and with root v, (in which 
v, must be paired with face g, other pairings being the same). 


(10.2). Let M’ be the map obtained from M by deleting vertices v;, v; and 
incident edges and inserting a new edge from v, to v, (fig. 2b). Then M' is 
bicubic, and there is a (1 : 1) correspondence between matchings with root 
v, (A vj) in M and in М”. 


Fig. 2a. Fig. 2b. 


11. Widening an edge 


Suppose that in a matching a black vertex v, is paired off with (for example) 
a green face g. Let v,v, be the green edge from v, incident with a red face r 
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and a yellow face y, and let g’ be the other face adjacent to v,. Thus we have 
the situation shown in fig. 3a (though not all vertices or all faces shown there 
are obviously distinct). However, ve must be distinct from v,. For if not, we 
would have a digon v, v; v, colored yellow, and the vertex v, would have to 
pair off with this digon, contrary to hypothesis. Similarly v, + Vm v, Æ v; 
and v,  v,. We construct a new bicubic map, M,, say, in which the vertices 
Vg, Va and incident edges are removed and two new edges v,v,, vv, put in their 
places, as shown in fig. 3b. 

The faces other than g, g’, у, ғ are unaltered, y is replaced with a new face 
You, r With a new face r,,, and g and g' are coalesced into a new face g,,. Now 
if g and g’ are distinct faces, we can construct а new matching in M, simply 
by leaving the root unchanged and pairing off each face as before (with у, 
rg, and Zyn, if non-root faces, paired with the black vertices formerly paired 
with y, r and g' respectively). Hence M,, is connected. We can show that 
g 5 g'. For suppose otherwise; then M,, would be disconnected, falling into 
connected parts М, Mon”, say with a common exterior face gj. Also the 
matching of M gives rise to a matching іп M,,, except that the face g,, is 
not matched with a black vertex: without loss of generality, we may suppose 
that the root vertex is in М. Hence the matching of M induces a matching 
in M," in which every face except the exterior is paired with a black vertex. 
But this is impossible, since there are f [M] black vertices and В [M] + 1 
interior faces. 

If g' is a root face, it follows that g,, is a root face in M,,, so that con- 
versely each matching in M,, gives rise to a matching in M. 


12. The triad of alternating dimaps 


In the bicubic map M place in each green face g, a “green vertex" g,*. If 
vwy; is a green edge, with v, incident with green face g,, and v; incident with 
green face g,, construct a new arc from g,* to g,* as the union of an arc from 
&1* to v, (inside the face g,), the edge v,vj, and an arc from v; to g,* (inside 
Ex) (see fig. 4). The new arcs can be arranged to intersect only at their end 
vertices g,*. Hence we get a new plane connected alternating dimap 4,[M] 
whose vertices are the g,* and whose edges are the arcs g,*g,* so constructed. 
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Informally it may be described as obtained from M by shrinking the green faces 
to points. Conversely, from any alternating dimap we obtain a unique bicubic 
map by “expanding the vertices into faces”. 

By replacing “green” by “red” or “yellow” we get other alternating (con- 
nected) dimaps, Д„[М], 4,[M], respectively, the three forming a triad. 


13. Rooted trees 


Consider а map M and matching џи. There will be three root faces, one of 
each color, giving rise to a “root vertex” in each of the alternating dimaps 
АМ}, ALME), 4,[M]. 

Let vy be a black vertex: suppose it paired off with a green face g,. Then 
g,*v, is the first part of a directed edge g,*g,* in A,[M ]. In this way the 
matching u gives rise to a set of directed edges in each of the triad of alter- 
nating dimaps. This set includes exactly one edge directed away from each 
vertex other than a root vertex. Furthermore, this set of directed edges cannot 
include a circuit. For suppose it did; widen in turn all the corresponding edges 
in M except one. Then we will be left with a matching of the type in fig. 4 with 
gx = Zr, Which we have shown cannot occur. Hence the set of directed edges 
in A [M] must be a forest, and there is exactly one directed edge from each 
vertex except the root. By a standard result (for finite graphs) it is a spanning 
tree directed towards the root. Thus a matching of M gives rise to three rooted 
spanning trees, one in each alternating dimap. (This result can fail for infinite 
dimaps.) 

Conversely, a rooted spanning tree in one of the alternating dimaps, say 
A, [M], corresponds to a unique matching in the (rooted) bicubic map, such that 
the G-colored root face in M corresponds to the root vertex in A,[M]. We 
establish this by a double induction. For the simplest bicubic map, the tri- 
hedron, it is trivial: there is just one tree in 4,[M] (consisting of one vertex 
and no edges) and one (empty) matching M. Now suppose that 4,[M ] has 
just one vertex (the root vertex) so that M has one green face, g, say; then 
we have to show that M has just one matching. Suppose this true for maps М 
containing <k faces, with k >3; we show it true when M contains k + 1 
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faces, one only being green. All vertices of M now lie on the boundary of g, 
and since k + 1 Z 4, there are at least 4 vertices on the boundary of g. Let 
v, be a vertex on this boundary; it must be joined by a green edge to another 
vertex v, on the boundary (fig. 5); v, and v, then divide the boundary of g 
into two segments, s,, ғ, say. There must be another vertex v, on the boundary, 
say in segment s,, and this must be joined to another vertex on s, (not s;, 
since this would make the graph non-plane). Now, either v,v, together with 
52 form a digon, or there exists another vertex v, in s2, which must be joined 
to a second vertex v,, in s2. On repeating this construction sufficiently often, 
we must find a digon composed of one boundary edge e,,“ of g in s,, and 
a green edge e,,?. There must be a similar digon somewhere оп sz. The 
interior of at least one of these digons is not a root face. By the inductive 
hypothesis, if this digon is removed as in sec. 10 above, the map has just one 
matching; hence the same holds with the digon included. This completes the 
induction for maps with a single (green) face. 

Now we apply induction over the number of green faces. Suppose the 
theorem true for maps with < u green faces, and consider one with u + 1. 
The spanning tree in A, [M] will contain at least one edge running from a 
green vertex to a green root vertex, corresponding to a green edge in M directed 
to a green root face. If this edge is widened, so that the number of green edges 
is reduced, there is by the inductive hypothesis just one matching associated 
with the directed tree; hence by the argument in sec. 10 this must hold before 
the widening. This completes the induction. : 

Thus there is a (1, 1, 1, 1) correspondence between the matchings in M (with 
any given root) and the number of directed trees (with corresponding roots) 
in A,[M], A.[M ], A,[M ]. If each edge is given unit conductance, these are 
balanced dinetworks. Hence 


Theorem (13.1). The number of matchings of M is equal to the (unweighted) . 
diplexity of any one of the triad of alternating dimaps A,[M ], 4,[M ], 4,[M ]. 


\ 
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Since the diplexity of a balanced dimap is independent of the choice of root, 
and is unaltered by reversing the direction of every edge, it follows that the 
matching number of a bicubic graph is independent of the choice of root and 
of the exchange of vertex colors black and white. This, together with theorem 
` (4.4), establishes the main theorem (9.1). 

The importance of this result is as follows. We have seen that any equilateral 
triangle divided into equilateral triangles may be thought of as a “leaky electrical 
network”; in such a way that each triangle corresponds to a wire, and the side 
of the triangle to the current in the wire. Furthermore, there is a natural ex- 
pression for this (“full”) current, either as a determinant, following Tutte y; 
or in terms of trees, by eq. (5.9) above. Using full currents, the side of the 
triangle — total current flowing — diplexity of the network. Now we can look 
at the triangle in three possible ways, by rotations through + 120°. Under 
these rotations the sizes of the geometric triangles will remain unaltered; hence 
the ratios of the “full” sides = “full currents" in the corresponding networks 
Д, А, Ay are unaltered. Theorem (13.1) implies that the actual full currents 
themselves in corresponding wires in 4,, 4,, A, are equal, and not merely their 
ratios. 


14. Reduction formulas 


We have shown that every bicubic map М contains either a digon or a 
tetragon. If it contains a digon, as in fig. 2a, we can obtain a new bicubic 
map М” eliminating the digon, as in fig. 2b. By (10.2) the matching numbers 
of M and M’ are equal, say 


Mat [M] = Mat [M"]. (14.1) 


If M contains a tetragon, but no digon, the configuration will be as in 
fig. ба. Note that v, 4 vp, for otherwise М would contain an isthmus, and 
v, Æ v,; it then follows that all the vertices shown are distinct, for otherwise 
there would be a digon. We can now construct from M two bicubic maps 
М", M" (not necessarily connected) with fewer vertices, as in fig. 6b. (This def- 
inition can readily be written out formally at length.) We have then 


Mat [М] = Mat [M'] + Mat [M"']. (14.2) 


This can be seen in two ways. In terms of the corresponding alternating di- 
graphs it is equivalent to (3.3d). Alternatively, the matchings of M divide into 
4 types: two types, in one of which v, pairs with f and v, with К, and the other 
in which v, pairs with ^ and v, with f, which correspond to matchings in M’, 
and the remaining two which correspond to matchings in M”. 

By repeated use of (14.1) and (14.2) we eventually reduce any bicubic map M 
to a set of maps made up of trihedra: the number of such maps consisting of 
a single trihedron is equal to Mat [M]. Of course, (14.2) can be applied when- 
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ever all the vertices shown in fig. ба are distinct, whether or not there is a digon 
elsewhere in the map, and the order in which the digons and tetragons are 
eliminated does not affect the final result. 

We should have forbidden the use of (14.1) and (14.2) in certain cases when 
their application would result in a map which is not bicubic according to our 
definition. In fact, it then results in one component being a hoop, that is, a face 
surrounded by a closed edge containing no vertices. Euler's Theorem does not 
apply to a hoop. If we allow such an "improper bicubic map", assigning it 
matching number | when it is connected and therefore a single hoop, and 
otherwise 0, then (14.1) and (14.2) become unrestrictedly valid. 
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1. Introduction 


An unsolved problem in the theory of convex polyhedra (and in combinatorics 
and graph theory as well) is to find a formula giving, or at least an algorithm for 
calculating, the number of combinatorially distinct polyhedra with a given num- 
ber of vertices. Grünbaum states, in the 1974 New Encyclopaedia Britannica +7), 
that “Euler was not successful in his attempts . . . to determine the number of 
types for each v. Despite efforts of many famous mathematicians since Euler... 
the problem is still open”. Shephard +7), in his discussion of the unsolved prob- 
lem, indicates that, failing a solution “it would be a considerable achievement 
to find a formula which gave a reasonably close approximation . . . when v is 
large". The problem is the same if faces are referred to instead of vertices, since 
one is the dual of the other, and enumeration by the number of edges is in the 
same category of unsolved problems. 

This article presents a compilation of results relating to the above problems. 
Table I gives the number of polyhedra with n edges, as far as known, with some 
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estimates for higher n, and some further data. Table II gives the number of 
polyhedra having a given number of faces and vertices for certain ranges of values 
and estimates for others. The sources of these data and explanations of the 
tables, and other material, are given in the form of a limited historical review. 

This is not the place to define and explain convex polyhedra, symmetry, 
duality, etc. Combinatorial equivalents here include mirror images, enantio- 
morphs; two polyhedra are isomorphic, combinatorially equivalent, if their 
vertices can be placed in one-to-one correspondence such that if two vertices are 
connected by an edge in one, the corresponding vertices in the other are also 
connected by an edge. 

For illustrative purposes, figs 1 and 2 show the 4-, 5- and 6-faced polyhedra. 
Figure | gives perspective views. Figure 2 gives corresponding projections onto 
one face from a point outside and close to the center of that face; these are 
commonly referred to as Schlegel diagrams, although such projections had 
already been used before Schlegel. 

Modern developments in the theory of polyhedra can be said to have com- 
menced with Euler’s theorem of 1752 connecting the number of vertices, faces 
and edges of a polyhedron; however, results relating to the above enumeration 
problems did not appear until the next century. In 1829, Jacob Steiner (ref. 18, 
p. 229), in Gergonne’s Annales des Mathématiques, listed the 4-, 5- and 6-faced 
polyhedra according to the number and nature of their faces and asked the 
question: “What is the general law?” Later, in his book of 1832 (ref. 18, p. 454), 
he stated that no solution had been forthcoming and rephrased the question, 
“How many different 7, 8, 9, ..., faced bodies are possible ... ?” While Steiner 
did not so state, he was concerned with combinatorially distinct convex poly- 
hedra. Again no answer was forthcoming. 

Considerable work on polyhedra appeared after 1850 but the mathematicians 
who did the most work on problems of actual enumeration were the Rev. 
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Thomas P. Kirkman (Rector of Croft-with-Southworth) and the two Germans, 
Professor Oswald Hermes (of Stieglitz) and Professor Max Brückner (of 
Bautzen). 

Attempts at finding a general formula or algorithm for enumerating the 
classes here considered failed. Kirkman, who initiated the work in a paper of 
1855 16) stated in a later paper, of 1878, “Mathematicians will never be satisfied 
until they can write down the number of P-acral Q-edra (polyhedra with P 
vertices and О faces) in terms of P and О. There is no reason to hope for that 
achievement from the present power of analysis". The same thought was echoed 
many years later by Harary and Palmer in a recent advanced work on enumera- 
tion 14), where they indicated that the enumeration of 3-connected graphs (see 
sec. 2) "evidently requires more powerful methods than now exist". 

The results of actual enumeration at the turn of the century are in the works 
of Hermes and Brückner. These include a list of polyhedra with up to 8 faces +5), 
later verified; a list and diagrams of simple polyhedra with up to 10 faces 15,6). 
later verified; a list of polyhedra with 9 faces and 9 vertices !5), later shown to 
have a duplication and two omissions, and incorrect totals for the number of 
simple polyhedra with 11 (1250) and 12 faces. These results were obtained by 
constructing the polyhedra by methods which yielded large masses of figures, 
which then needed to be compared to eliminate equivalents. Except for some 
further work by Brückner ?) in deriving totals later mainly shown to be incor- 
rect, work in this field practically ceased for a long time. The resumption is 
considered in the following sections. 


2. Planar graphs 


The diagrams in fig. 2 can be looked upon in several ways. As networks of 
points with lines connecting some pairs of them, they are "graphs" of the kind 
studied in graph theory; they are also in some connections referred to as planar 
maps. These particular diagrams are “3-connected planar" graphs. A good 
deal is packed into this term. “Planar” indicates that the graph can be drawn on 
the surface of a sphere, or in a plane, without any lines crossing each other or 
meeting, except at the vertices where they are supposed to meet; if it can be so 
drawn in any manner, it can be done with straight lines, and any planar graph 
has a dual. “Connected” in general means that the graph is in one piece. The 
term “3-connected” means that the graph cannot be divided into two parts 
which are connected to each other at less than three vertices, each of the two 
parts having at least two edges (a single edge is of course connected to the rest 
of the graph at only two vertices). This excludes, among others, figures with 
2-valent vertices or with faces having only two sides. The diagram of a cube with 
a V-shaped notch cut into one edge would have a part connected to the remain- 
der at only two vertices. It is not enough that the graph be 3-connected ; if two 
farthest removed diagonally opposite vertices of the skeleton cage of a cube are 


THE NUMBER ОЕ POLYHEDRA 223* 


joined by a line, the result, considered as a graph, is no longer planar. It is still 
3-connected but it cannot be drawn on the surface of a sphere or in a plane 
without lines crossing or meeting where they should not, and it does not 
represent a convex polyhedron. The article by Tutte 25) contains a fuller simple 
explanation of these concepts. 

The vertices and edges of any convex polyhedron form a 3-connected planar 
graph. The now famous theorem of Steinitz 29:21), as restated by Grünbaum 
(ref. 11, p. 235) is to the effect that any 3-connected planar graph can be realized 
as a convex polyhedron. Consequently graph theory becomes applicable to 
polyhedra. This theorem and its consequences seem to have been ignored for 
some time and do not appear to have been utilized until after 1950. It serves, in 
some respects, as retroactive validation of some of the things done by the early 
workers. 

The 3-connected planar graphs have turned up in a surprising way in a 
seemingly entirely unrelated area. This is in connection with the problem of 
dividing a rectangle into unequal squares. The basic paper of 1940 by Brooks, 
Smith, Stone and Tutte 5) showed, by means of an electrical analogy, that any 
such rectangle (not compounded with another rectangle) can be derived from a 
3-connected planar graph, which they then called a c-net, and these were there- 
after studied extensively with this application in view. This relationship is 
referred to by Kac and Ulam in their book *Mathematics and Logic" (Mentor 
paperback, 1969) as an illustration of “the remarkable and wholly unexpected 
connections" which occur in mathematics. That polyhedra (in effect) and plane 
rectangles divided into unequal squares, and Kirchhoff's laws relating to the 
flow of electricity in a network, are interrelated, is so intriguing that a digression 
must be made to show how this is done; not entirely a digression since it was 
this work which gave the impetus to the revival of the polyhedron enumeration 
problem. This is done by way of a specific example. 


(a) 


Ап example, the simplest which gives perfect results, is given in fig. 3, where 
(а) is the hexahedron number 9 of figs 1 and 2. The edge connecting points a 
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and b is removed and the figure is redrawn as in (b), now considered as an 
electrical network with current 7 entering at a and leaving at b and the wires 
each of unit resistance. Current directions are assigned, generally trending 
downward, but an error is immaterial since if a particular current turns out 
negative the direction and the sign can be changed simultaneously. The relative 
values of the currents can be found readily by calling that in wire fc, x, and that 
in wire df, y, and calculating the values of the others in terms of these (since the 
resistance of each wire is 1, the voltage between the ends has the same numerical 
value as the current), which leads to but one final equation, y — 7x. The result- 
ant relative values are written in in (b). The currents and the arrangement of the 
wires correspond to the 33 x 32 rectangle divided into 9 unequal squares shown 
at (c). The process can also be regarded as placing an arbitrary e.m.f., a battery, 
in edge ab of (a) and finding the relative values of the currents in the other edges, 
considered as wires with unit resistance. Thus you get a rectangle divided into 
unequal squares from a polyhedron. Placing the battery in edge eb results in a 
61 x 69 perfect rectangle. The other edges will result in the same two rectangles, 
or in an imperfect rectangle, that is, some component squares will be equal, 
because the polyhedron is self dual, and also has an axis of 180? rotational 
symmetry. 

The theory is developed in the basic paper referred to, and a popular account 
by Tutte has appeared in the Scientific American 23). A three-part paper by 
Bouwkamp amplifying and extending the previous results appeared in 
1946-47 +). 

For use in connection with the generation of rectangles divided into unequal 
squares, aiming at finding squares divided into unequal squares, sets of c-nets 
were constructed, first by hand for the lower-order ones, and then by computer. 
Bouwkamp's paper gives the diagrams of the c-nets with up to 14 edges, includ- 
ing duals (two omitted ones can be reconstructed from the last line of data on 
page 71 ?)) and Brooks et al. also had constructed these by hand. 

Since the interest was focused on the number of edges, a method was needed 
for producing all the c-nets (3-connected planar graphs) with a given number of 
edges from those with a lower number of edges. This was accomplished accord- 
ing to a theorem of Tutte developed in graph theory 23), obviously with the 
objective of its application to squared rectangles. Tutte's theorem (foreshad- 
owed, in part, by Kirkman, but without actual proof) appears in two forms. 
According to the first form, all the c-nets with n + 1 edges can be derived from 
those with n edges by performing the following two operations on the latter, in 
all possible ways: (I) adding a new edge by connecting two non-adjacent vertices _ 
of a face, and (II) splitting a vertex having at least 4 incident edges, each of the 
resultant two vertices having at least two incident edges, and connecting the two 
vertices with a new edge, and adding in the pyramid with a base of (n— 1)/2 
sides if n— 1 is even. In performing operation II, the splitting and insertion of 
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a new edge must be done in such a manner as to avoid losing the planarity of the 
graph unless non-planar graphs are also desired. The method does not produce 
pyramids, which must be fed in at the appropriate times. The second form of the 
theorem, which is limited to planar graphs, dispenses with operation ЇЇ and 
dualizes the results of operation I instead; this is the method which was pro- 
grammed for use with a computer. 

In 1960 the Dutch group, Bouwkamp, Duijvestijn and Medema, constructed 
and tabulated the c-nets with up to 19 edges, but listing only one of a dual pair. 
This was done by computer, taking nearly 11 hours on the IBM 650 at the 
Philips Research Laboratories at Eindhoven, The Netherlands. The method and 
program are given in a thesis by Duijvestijn 8); this involved solution of the 
problems of the representation of graphs for use by the computer, generating 
new graphs from old ones, generating dual graphs; and testing the large masses 
of figures to eliminate duplicates and equivalents. Their work also went on to 
generate squared rectangles from the c-nets, also by computer. The table of 
c-nets with up to 19 edges has not been published, but is available ?). 

The graphs being equivalent to convex polyhedra, considerable was added to 
the enumeration of polyhedra by the above work since now in effect all the poly- 
hedra with up to 19 edges had been constructed. And, incidentally, Hermes’ 
list of 7-hedra and 8-hedra stood checked as the computer produced the same 
results. Table II, explained in the last section of this article, which lists the num- 
ber of polyhedra in groups according to the number of faces and vertices, as far 
as known, has been constructed by utilizing the Bouwkamp et al. table up to 
19 edges, as well as other sources. 

Diagrams of the c-nets, hence polyhedra, with up to 17 edges have been 
constructed from the Bouwkamp et al. table and drawn neatly on a set of sheets, 
by Ray C. Ellis, Jr., of Massachusetts. The table was also used in ref. 9a in 
checking Hermes’ lists and in ref. 9b, and is referred to in ref. 11. 

Simple polyhedra are those with every vertex of degree three. They are the 
easiest to construct. In 1965 Grace derived and listed the simple polyhedra with 
up to eleven faces by computer, using the method of dividing faces by a line 19), 
The program took about 12 hours to run on a Burroughs B 5000 computer at 
Stanford University. His results confirmed the previous results up to 10 faces 
but his figure for 11 faces is 1249. Grace used a short-cut in testing for equiv- 

. alency, adopting a necessary condition of equal surroundedness of correspond- 
ing faces which, knowingly, was not a sufficient condition and would introduce 
errors when л was very high. A little later Bowen and Fisk generated “triangu- 
lations”, polyhedra with all faces triangular, with up to 12 vertices, by com- 
puter ^). These are the duals of the simple polyhedra with up to 12 faces and 
would be the same in number. They obtained 1249 triangulations with 11 ver- 
tices, duals of the simple polyhedra with 11 faces, agreeing with Grace, and 
7595 triangulations with 12 vertices, duals of the simple polyhedra with 12 faces, 
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disagreeing with both Hermes (7553) and Briickner (7616). The computing time 
was 14 hours on the IBM 7094 at the University of California at Berkeley. 


3. Rooted planar graphs 


The next step to be mentioned and the first step towards a general law is due 
to Tutte (ref. 24, see ref. 25 for a popular account). Tutte introduced the concept 
of a “rooted” c-net. One edge is specified as the “root”, with one end as positive 
and the other end negative, and with the two sides also distinguished, as left 
and right. On a diagram this can be indicated by an arrow on the root edge and 
a letter / and r on each side. Either side may be marked / or r giving two rooted 
graphs, and the direction of the arrow can be reversed giving two more, making 
four in all from one edge. Two rooted c-nets which would be equivalent if 
unrooted, are not equivalent unless they can be brought into coincidence with 
the rooted edges coinciding as well as their directions and the designations left 
and right. If the c-net has п edges, the number of rooted c-nets produced from 
it will be 4n. These will be distinct if the c-net has no element of symmetry. How- 
ever, if the c-net is symmetrical, the number of distinct rooted c-nets produced 
from it will be reduced, depending upon the nature of the symmetry. A single 
plane of symmetry, or a 180? rotational symmetry, will reduce the number to 
1/2; two planes of symmetry will reduce the number to 1/4; three planes of 
symmetry will reduce the number to 1/6 if they pass through one axis and to 1/8 
if they are mutually perpendicular. Higher degrees of symmetry will reduce 
the number still further. Harary and Tutte have shown that the number of 
rooted c-nets produced from a c-net with n edges is 4n/h, where Л is the order of 
the automorphism group of the particular graph +°). 

Tutte has derived an explicit formula giving the number of rooted c-nets for 
any n. This is the first breakthrough in the problem of finding a general law for 
the number of polyhedra. He also gives an associated recursion formula used 
for computations and a table of the number of rooted c-nets up to n — 25. This 
table is repeated here in table I with some added material. The significance of 
the achievement of Tutte's formula may be appreciated when it is considered 
that there is no formula which enables the number of c-nets to be calculated for 
a given л, there is also no formula for determining how many of these would be 
symmetrical, nor any for obtaining the nature of the symmetries, yet here is a 
formula for calculating a quantity dependent upon these uncalculatable el- 
ements. 

With the number of rooted c-nets for a given n known, it is still not yet possible 
to calculate the number of unrooted c-nets from it, but a fair approximation 
can be made. As pointed out by Tutte, if it be assumed that the number of sym- 
metrical c-nets becomes negligible in comparison with the number of unsymmet- 
rical ones for large enough n, an approximation of the number of unrooted 
c-nets can be obtained by dividing the number of rooted ones by 4n. This 
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TABLE I 
Number of c-nets 
1 2 3 4 5 6 ` 7 


edges rooted unrooted percent actual  symmet- percent 
estimate deficiency number rical symmetrical 


6 1 1 1 

8 4 1 1 

9 6 2 2 
10 24 2 2 
11 66 4 4 100-0 
12 214 | 12 10 83-3 
13 676 13 40:9 22 16 72-7 
14 2209 40 31:0 58 32 55:2 
15 7296 122 22:8 158 62 39-2 
16 24460 383 145 448 123 27:5 
17 82926 1220 91 1342 234 17:4 
18 284068 3946 6-0 4199 470 11-2 
19 981882 12920 3-5 13384 906 6-8 
20 3421318 42767 
21 12007554 142948 
22 42416488 482006 
23 150718770 1638248 
24 538421590 5608558 


25 1932856590 19328566 
26 6969847484 67017765 


approximation is in fact a lower limit. How well it fits is indicated in table I. 
(Tutte’s formula, when simply divided by 4, also gives an estimate of the number 
of squared rectangles with n — 1 component squares.) 

In table I the second column gives the number of rooted c-nets as given by 
Tutte up ton = 25, with an added line for п = 26. Column 3 gives the estimates, 
lower limits in fact, obtained by dividing the numbers in column 2 by 4 times 
the corresponding value for л; these then are estimates of the number of poly- 
hedra having п edges. Column 4 gives the percent deficiency of the estimate 
from the actual numbers as far as known, which are given in column 5. The 
actual number of c-nets, 3-connected planar graphs, polyhedra, by number of 
edges is known up to п = 19 from the Bouwkamp et al. table. As seen from 
column 4, the deficiency of the estimate becomes less and less with increasing 71, 
going down to 3:5 % for п = 19. Column 6 gives the number of symmetrical 
c-nets (obtained mainly from the Bouwkamp et al. table) and column 7 the 
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percentage of symmetrical ones, this percentage decreasing to 6:8 for n — 19. 
The actual number of symmetrical c-nets does not decrease with higher п, as 
seen in column 6; in fact the portion of this column from n — 13 to n — 19 is 
very close to a geometric progression with a common ratio of nearly 2. The 
unsymmetrical ones increase much more rapidly (the numbers, column 5 minus 
column 6, from n = 14 to п = 19 are very close to a geometric progression with 
а соттоп ratio of approximately 3-4), and it is this fact which causes the propor- 
tion of symmetrical c-nets to decrease. 

Tutte states that the assumption that the number of symmetrical c-nets 
becomes negligible in proportion for higher » "seems highly plausible to the 
present author, but no proof of it is known". The data in the table do not of 
course constitute a proof of the assumption, but they do increase the plausibility. 
The Tutte formula, divided by 47, is in fact a formula giving an estimate of the 
number of polyhedra with п edges. The question is whether it is not also an 
asymptotic formula. This has not been proven but appears to be highly plausible. 

That the number of rooted c-nets given by the Tutte formula agrees with the 
number obtained from the individual polyhedra by the formula 4n/h has been 
determined up to n = 17. The Bouwkamp et al. table identifies the symmetrical 
c-nets for these and the only problem was determining the order of the auto- 
morphism group for them. 


4. Number by faces and vertices 


The Tutte formula gives only the total number of rooted c-nets for a given 
number of edges. A further step has been taken by Mullin and Schellenberg +7) 
who derived a formula for calculating the number of rooted c-nets subdivided 
according to the number of faces and vertices. The formula is not reproduced 
here, as neither are the Tutte formulas, as they are quite complicated and would 
require too much explanation. The paper gives a table of the number of rooted 
c-nets with up to 16 vertices, subdivided into groups according to the number 
of faces. To the extent of this table, estimates (lower limits) can be derived for 
the number of polyhedra having a given number of faces and vertices simply by 
dividing the number in the table by four times the number of edges. Such 
estimates are given in table II of this paper. 

Table II presents a summary of the number of polyhedra by faces and vertices. 
Excluding the figures in parentheses, the numbers given in each case save one 
refer to polyhedra which have been actually derived and listed in published 
sources or in the Bouwkamp et al. table. The exception is the number for 
9 faces and 13 vertices. When a table similar to the present one was first con- 
structed, it was noticed that the column for 9 faces was complete except for one 
missing group, the second from the bottom. A way was found of separately 
generating the members of this group, which turned out to be 219, in a simple 
manner, by applying operation II of Tutte’s theorem in reverse to the individual 


THE NUMBER ОЕ POLYHEDRA 229* 


TABLE II 
Number of polyhedra 


74 176 38 14 

9 8 74 296 633 768 558 219 50 
10 5 76 633 2635 6134 (8568) (7696) (4241) 
11 38 768 6134 (25252) (63748) (103362) (111038) 
12 14 558 (8568) (63748) (266801) (706770) (1259093) 
13 219 (7696) (103362) (706770) (2932103) (8075607) 
14 50 (4241) (111038) (1259093) (8075607) (33287246) 
15 (1317) (78959) (1552824) (15521490) (94674237) 
16 233 (35653) (1334330) (21376268) (193729740) 
17 (9440) 

18 1249 
total| 12 7 34 257 2606 (31538) (435641) 


12727117 495 


members of the last group in the same column ?). These latter were 50 in num- 
ber and had been listed by Hermes in 1899 and Grace in 1965. The total number 
of 9-heara came out as 2606. Hence Steiner's question of 1832 was answered 
for 9 faces, but it has not yet been answered for his row of three dots. nor for 
his n. The 219 polyhedra with 9 faces and 13 edges have been independently 
derived by R. M. Foster of New Jersey, and exchange of results showed that 
the two sets correspond. 

The numbers in parentheses in table II are the estimates derived from the 
Mullin and Schellenberg table, except for the entry for 11 faces and 17 vertices 
which has been separately calculated in order to make this column complete. 
In general the actual numbers would not be more than a few percent greater 
than these estimates. Taking into consideration the groups whose values are 
known, the estimate (lower limit) for the total number of 10-hedra is 31538 and 
for the total number of 11-hedra it is 435641. The number of 12-hedra would 
be considerably over 5000000; extrapolation suggests about 7000000. 

The symmetry of table II about the main diagonal from upper left to lower 
right is due to duality. Groups symmetrically placed with respect to this main 
diagonal are the same in number of members, which are duals of each other. 
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The diagonals in the other direction pass through the entries having the same 
number of edges. The number at the bottom of a column, when the column is 
complete, is the number of simple polyhedra with the indicated number of 
faces; the number at the right end of a line, when the line is complete, is the 
number of polyhedra with all faces triangular. The bottom of the table gives 
totals and the number of symmetrical polyhedra. | 

The Mullin and Schellenberg table of rooted c-nets offers a means of checking 
the number of polyhedra with a given number of faces and vertices from the 
individual figures of this group if known. This is done by determining the order 
of the automorphism group for the symmetrical ones and calculating the total 
number of rooted c-nets. This has been done for the groups with up to 8 faces 
(therefore also those with up to 8 vertices) and the groups with up to 17 edges, 
and also the bottom two entries of the column for 9 faces and the bottom entry 
of the column for 10 faces. 

The Mullin and Schellenberg formula, when divided by 4 times the number 
of edges is a formula giving an estimate of the number of polyhedra having a 
given number of faces and vertices. The question arises as to whether it is not 
also an asymptotic formula. For reasons similar to those previously expressed, 
this conjecture is also quite plausible. This is shown by table III which gives the 
percentages that the estimates are of the actual numbers where these are known. 
As can be seen, the percentages increase along the main diagonal and along any 


TABLE III 


Percent closeness of formula estimate 


vertices 
11 12 
71:0 396 
92:8 90-5 
97-1 — 


total 43 104 243 531 726 89:9 
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diagonal parallel to this one. The lower ones are quite remote, as is common 
with asymptotic formulas, but the higher ones get closer, in percentages, to the 
actual numbers. s 
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Bouwkamp's thesis +) is a cornerstone in the theory of spheroidal functions, 
in their numerical treatment and their application to a specific problem which 
was of great interest at that time. As a matter of fact, he treated the diffraction 
of scalar waves through a circular aperture from long wavelengths down to such 
wavelengths where the Kirchhoff-Rayleigh approximations are already quite 
good. At that time the numerical computations were quite cumbersome. With 
the advent of the present fast computers the situation has much changed. Not 
only can one easily carry through the calculations with an increased number 
of significant digits, one can also go to much smaller wavelengths as compared 
to the radius of the aperture. 

Our goal in this note is to extend Bouwkamp's results to wavelengths which 
are one half of his, but also to compare the exact solution, which is a series in 
spheroidal wave functions, with the two Kirchhoff- Rayleigh approximations, 
also expanded in spheroidal wave functions. We restrict ourselves, however, to 
the case of a hard screen with a circular aperture. The other case of an infinitely 
soft screen with a circular aperture can be treated in a similar fashion. 

The problem to be treated is the diffraction of a scalar plane wave by a rigid 
plane with a circular aperture А. The diffracting screen S is in the plane z — 0, 
the centre of the aperture is at x == y = 0, its radius is a. The plane wave 
exp (—ikz) is impinging from the half-space z « 0 normally to the plane 2 = 0. 
The transmitted wave in z > 0 can be written in terms of spheroidal functions 
as 


PE, 7, ф = Y Car Sor(—i; iy) рѕ210; —у?) (1) 


L=0 


*) This paper is dedicated to C. J. Bouwkamp, from whose thesis the first author not only 
learned a little of the Dutch language but also got the confidence that it is worth while 
to study the spheroidal functions. 
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with 
рѕ21(1; —y?) 
disparu o 0) 
1+10,1(у) 
where О,1(у) is given by (12). The spheroidal coordinates £, 7, p are defined 


by 
х= a(&£ + 1)!? (1 — 7) cos g, 


y = a (£^ + 1)? (1 — n?) sin g, (3) 
z — aém. 
Furthermore 
y—ka-2za[A (4) 


where k is the wavenumber, 4 the wavelength. 
The boundary conditions are 


oy : 
— = 0 on S, y —] in A. (5) 
02 
The problem is Bouwkamp’s problem (1b) +). The notation of the spheroidal 
functions is taken from ref. 2. 
We also introduce the Kirchhoff-Rayleigh approximate solution 2, for 
Bouwkamp’s case K2. It satisfies the conditions 
dD K2 d K2 
id = 0 on S, is 
02 02 


= —ik in A. (6) 


It can be expanded in spheroidal functions like in (1), but the coefficients C,, 
are to be replaced by 
Ь 2L(,,2 І. 2 
Cork? = y? (4L + 1) oy”) 220 ) ) (7) 
рѕ21(1; —у?) 


where 
1 


larly’) = f 7 psor(n; —y”) dy. (8) 
о 


In the derivation of this expansion the Wronskian (ref. 2, р. 294) and (11) are 
used, 

Bouwkamp has compared the solutions y and y*? by expanding them in 
spherical wave functions and by comparing the numerical values of the coef- 
ficients for y = 5 and y = 10. Quite interesting results are obtained if, instead, 
one compares the coefficients C;; and C;,*? in the expansion (1) and in the 
corresponding expansion for y*?. This will be done for у = 10 and у = 20. 

The coefficients by?4(y?) in the expansion 


рѕ2:(7; —у?) = 2: by?! (y?) Pan), (9) 


N=0 
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where Р,м(2) = Legendre polynomial of degree 2N, with the normalization 
condition 


e 


[bn (y)? 1 
) : = ЕЯ 1; —у2) > 0 10 
АМ +1 4L + 1 рез y) up 


N=0 
have been calculated for y = 10 and y = 20. The eigenvalues of the differen- 
tial equation for рѕ,,(7; —y”) have been taken from the tables 3). Thus the 
calculation, which otherwise used Bouwkamp’s method of continued fractions, 
could be considerably simplified. These tables which also contain 


$5, (—i0; iy), j = 1, 2, 
provided a useful check of our numerical results through the two relations 
b 2L(4,2 
Sor(—i0; iy) = (71) ML d NM (11) 
рѕг1(1; —y?) 
5126—10; р) x ES | 
5190—10; 5) 2yL be? d 
Table I contains values of ps2z(0; —у?), рѕг1(1; —у?), 22 (у?) and L,(y?) 


for у = 10 and у = 20. In table II we give the values of the coefficients C2,, 
C,,*? and their contributions to the transmission coefficient, 


QuQ):— (12) 


LUN ET o4? E 1 [e dur (13) 
y! AL +1 ' y? 4L 1 


СЕТА 


For the same values of у the transmission coefficients D; and D*? аге ob- 
tained by summing these contributions over L. For the rigorous solution we 
obtain 


D,(10) = 1 — 0-002 073 135, 


(14) 
D,(20) = 1 — 0:000 618 554, 
while the asymptotic formula *) 
1 cos (2y—2/4) 7 sin (2y — 2/4) 
Dy) = 1—— + ———— ^ — 
4y? 4 gi у??? 64 zi? y"? 
1 sin (4y — л/2) 
— — + О —9/2 15 
16 yt 64 x y* QU) Ho) 
yields 
D,(10) ~ 1 — 0-002 070 475, 
(16) 


D,(20) ~ 1 — 0:000 618 531. 
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TABLE I 
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Numerical values of the spheroidal functions ps,,(0; —y2), pszz(1; —y?), the 
first term of its expansion in Legendre polynomials and the integral Г,,(у2) 


PS2z(0; —100) 


0-000 813 921 
—0:013 195 170 
0-142 454 360 
—0-395 987 554 
0-342 507 590 
—0-279 938 449 
0-245 901 337 
—0-222 983 981 
0-205 954 660 
—0-192 564 821 
0-181 634 169 


PS21(0; —400) 


0-000 000 052 
—0-000 001 774 
0-000 046 161 
—0-000 809 642 
0-009 967 985 
—0:086 201 577 
0:296 814 884 
—0-302 758 317 
0-249 059 344 
—0:220 365 401 
0-201 421 516 
—0:187 345 805 
0-176 226 112 
—0-167 095 555 
0-159 392 860 
—0-152 762 497 


рѕ21(1; —100) 


4-352 285 688 
1:816 161 791 
1:168 436 570 
0-803 293 693 
0-958 505 058 
0-986 319 157 
0-993 493 761 
0-996 477 924 
0-997 922 363 
0-998 694 079 
0-999 137 644 


рѕ21(1; —400) 


6:242 859 069 
2-712 803 726 
1:954 587 247 
1:559 665 055 


. 1:289 243 860 


1:040 623 680 
0-764 317 794 
0-903 357 521 
0-959 284 260 
0-976 307 870 
0-984 894 928 
0-989 826 232 
0-992 865 877 
0-994 836 701 
0-996 165 426 
0-997 089 973 


Ь024(100) 


‚ 0:460 356 895 
—0:223 335 757 
0-194 170 124 
—0-122 258 400 
0-017 189 123 
—0:001 055 765 
0-000 045 337 
—0-000 001 436 
0-000 000 035 
& —1.10719 
w 1.10711 


bo? (400) 


0-320 483 936 
—0-147 765 526 
0-114 468 122 
—0-100 331 086 
0-095 016 531 
— 0-098 943 970 
0-082 089 832 
—0-020 041 794 
0-002 084 752 
— 0-000 165 798 
0-000 010 568 
— 0-000 000 553 
0-000 000 024 
re —9 . 10719 
& 3.1071: 
ду —8.10713 


1.10100) 


0-411 482 831 
0-148 035 309 
0-074 325 127 
0:011 760 887 
—0-010 927 848 
—0-005 341 160 
—0-002 499 185 
—0:001 439 872 
—0-000 944 328 
—0:000 667 045 
—0-000 494 893 


Г,,(400) 


0:304 019 055 
0-124 509 834 
0-083 438 551 
0-060 619 398 
0-043 866 912 
0-028 665 777 
0-008 735 551 
—0:004 415 302 
—0-003 211 008 
—0-001 614 650 
—0-000 929 794 
—0:000 614 259 
—0-000 440 693 
—0:000 332 791 
—0:000 260 410 
—0:000 209 269 


Numerical values of the expansion coefficients С,, and С„,К? and of the partial amplitudes o5; and o; oe 


Re С,1(10) 


4-352 285 688 

— 9-080 806 226 
10-441 289 884 
— 2-810 603 098 
0-004 188 186 

— 0-000 000 170 
1.1071? 


Re С,1(20) 


6-242 859 069 
—13:564 018 632 
17:591 285 225 
—20-215 645 720 
21-917 129 240 
—21-775 712 475 
9-301 429 747 
— 0-081 299 914 
0-000 025 193 
— 0-000 000 002 
m 5.10714 
& —5. 10719 


Im С» (10) 


—0:000 002 137 
0-004 979 179 
—0-882 796 710 
4-631 536 125 
—0:261 203 400 
0-001 875 538 
—0:000 005 375 
0-000 000 008 


Im C2,(20) 


æ —1.1071* 
& 2.1071? 
—0:000 000 225 
0-000 103 700 
—0:018 944 799 
1:298 117 512 
—9:550 634 073 
1:457 132 152 
—0:028 240 605 
0-000 260 356 
—0-000 001 415 
0-000 000 005 


TABLE II 


C21*7(10) 


4-352 401 752 

— 9-102 046 386 
11-116 176 621 
— 2-326 953 993 
— 0:333 152 343 
0-012 006 169 

— 0-000 285 116 
0-000 006 017 


Ca, * Q0) 


6:242 859 074 
—13:564 019 325 
17-591 333 171 
—20-277 720 483 
21-984 170 125 
—22-894 893 665 
9-382 221 747 
1-136 306 065 

— 0-092 113 310 
0-004 058 181 

— 0-000 163 624 
0-000 006 174 


021010) 


0-378 847 814 
0-329 844 266 
0-243 999 699 
0-045 154 795 
0-000 080 288 
0-000 000 003 
ду 2.107 1^ 

ду 4.10720 


©1020) 


0-194 866 447 
0-183 982 601 
0-171 918 509 
0-158 116 081 
0:141 282 621 
0-113 301 610 
0-035 546 241 
0-000 367 214 
0-000 000 121 

ww 9.1071? 

ех 2.10716 

& 3.1072! 


2 


c;,F^(10) 


0-378 868 020 
0-331 388 994 
0-274 598 628 
0-008 330 331 
0-000 130 577 
0-000 000 137 
& 7.1071 
дч 2.10714 
c;,F?(20) 


0-194 866 447 
0-183 982 620 
0-171 919 446 
0-158 148 442 
0-142 148 158 
0-124 803 847 
0-017 605 217 
0-000 222 619 
0-000 001 286 
0-000 000 002 
£2 3.1071? 
& 4.10715 


*9Є@ 
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The transmission coefficient of the Kirchhoff approximation is 


р®2(10) = 1 — 0-006 683 313, 


(17) 
DK*(20) = 1 — 0-006 301 917. 


It is obtained by summing the individual contributions in table II, but also 
from the rigorous expression 


AQ 
реу) = 1— = , (18) 


where J,(2y) is the Bessel function of the first order. 

The calculations have been carried through on the CD 6400 computer of 
the “Rechenzentrum der Rheinisch-Westfálischen Technischen Hochschule". 
All digits given in the tables except for the last digit are considered to be safe. 

The surprising result is that D*?, although pretty close to D, in numerical 
value, has an asymptotic expression which departs from 1 by a term of order 
y^ 3/2 in contrast to (15) while, on the other hand, there is such close agreement 
between the contributions со and сок?. One expects that both can be expanded 
asymptotically in descending powers of y. The close numerical agreement at 
least to 107* for у = 10 and to 107? for у = 20 would, however, indicate, 
that the two asymptotic expansions agree in quite a number of leading terms. 
Therefore we have studied the situation by comparing the expression 


Cop’ = (—1)* (4L + 1) ps; (1; —у?) (19) 


with C,,*? for large positive values of у. The quantity C.,' is obtained from 
Cor in (2) by using 


Sar 9 (—i£; iy) = S,, C9 (—i£; iy) — i $;, ) (—i£; iy) (20) 


and noting that for ё = 0, y = 20 and L = 0,1 the imaginary part is about 
1071! times the real part or less. For y — 40 this would be true even for 
L = 0, 1,..., 6 (see ref. 3). We therefore examine the asymptotic expansion of 


Ki = eS == — ——————————— 21 
= CPG) Pb Ly E 


for large positive y and moderate values of L. 

An asymptotic expansion of the functions ps2,;(7; —y?) has been given in 
ref. 5. Although the normalization (10) drops out in (21), we give the expansions 
of the individual terms in (21) with regard to the normalized functions 
PS21(y; —y”). They are, with р = 2L + 1, 
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PS2r(1;—y?) ~ 


2y үг? p 4+1 p 
~ i ———_—_—__— 14р? + 11 Р А 22 
( ) | L— uo qa + ] (22) 


(—D* b? (y?) ~ 


2 1/2 р 8р? +3 р. 
~| ———— 1+ — 38 p? + 51 seu А 23 
(аср) | Tao ау “лу TT | e 


CD" Lily) ~ 


2 Mp 3p 8p45 
Е (xu) D P (26 p? +57) +.. | (24) 
y (4L +1) 4y 325? 128 y? 


Introducing these expansions into (21) yields 
K(y-214-0(y^9), as y — oo. (25) 


Our conjecture is that the non-occurrence of terms of the orders y~*, y~?, ут? 
is not accidental, and that all higher powers of y^! also drop out. But no general 
proof is available at the present time. This does not mean that С; and C,,*? 
are equal, rather is it expected that the О(у- *) can be replaced by O(y" exp (—y)) 
with some value of п, which may depend on L. 

Bouwkamp has also considered the other Kirchhoff-Rayleigh approximation 
K1 which has the boundary conditions 


yt —00nS$, к = 1 in A. (26) 

It can also be easily expanded in spheroidal functions 
yr = У, С S2r41 (iE; iy) PS21410; —у?) Q7) 

L=0 
with 0 < £ < œ, 0 <7 <1, where 
Б,21+ 1(y2) 1 
Cary! = by? (4L 43) — — ——À f pans s(n —y) de. (28) 
Е Р52+1(1;—у°) о 


At first sight it does not seem possible to relate simply the individual terms 
in the expansions (7) and (28). However, at large distance from the aperture, 
& > 1, we have 


SL (iE; iy) ~ i S5, €) (—i£; iy). (29) 


Moreover, for large positive values of y we have, apart from terms of order 
y" exp (—у) with some number n(Z) 
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(2—03) m (i) 

S ; MÍ|———— S ; , 

PS21410 4L 4-3 р52142] 

i 2а (С) s a B 
S $— Gre , 

: PS21411; —y*) ay 4L 4-3 o Y 


4L + 1 \ 12 
as] Ly). 


1 
ье угу ~ 3 | троа dn ~ ( 
[0] 
The square root of (4L + 1)/(4Z + 3) enters these expressions due to the 
different normalization factor of ps2, and ps;,, ,. 

Thus in the far field corresponding terms in both Kirchhoff expansions K1 
and K2 become equal to any order y^" (т = 1, 2, 3, .. .) asy — œ. But how 
fast this approach to equality is, depends on the value of L. We have chosen 

= 10 and the numerical results of Bouwkamp +) in order to calculate the 
contributions of the partial spheroidal waves to the total transmission coef- 
ficient for the case КІ. Their consecutive values are 037869; 0-32323; 0-18358; 
0:01460; 0-00009 of which four decimals are considered to be correct. These 
values should be compared with the c;,(10) and the o;,*?(10) in table II. 
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1. Introduction and summary 


A. finite region of turbulent flow is represented by a volume distribution of 
quadrupolar sources, according to the Lighthill model, for the purpose of 
estimating the sound level generated in the surrounding region. By way of 
incorporating environmental features into such a model of free turbulence it 
becomes appropriate to examine quadrupolar-source interaction with material 
surfaces; this poses a formidable diffraction or scattering problem unless the 
surface is compact on the acoustic wavelength scale or of simple geometrical 
shape. The calculations by Ffowcs Williams and Hall °), Crighton and Lep- 
pington ?) relative to an immobile point quadrupole near the straight edge of 
a thin semi-infinite plane screen, which bring out the possibility of a substantial 
amplification in the far field as the result of scattering by the screen, suggest a 
related rise for the output level of an actual flow in a comparable setting; and 
they prompt the consideration of acoustical effects attributable to the motion 
of sources past the edge. 

Point sources of fixed strength which are constrained to move rectilinearly at 
constant subsonic speed do not radiate energy; however, their passage by a 
screen or other material surface gives rise to an indirect or so-called diffraction 
radiation, whose space and time varying sources are continuously distributed 
thereupon. A particular measure of this interaction phenomenon is furnished 
by the total energy of the secondary radiation, which depends on the character 
of the source and its velocity, along with the nature of the screen and the rela- 
tive disposition of the trajectory. 

The linea1 wave equation for density fluctuations in a medium with constant 
sound speed c, 


3 


o? o? dQ oF, dTi 
Са т » o 
ot Ox; 4%; Ox; ox, 


i=1 


contains a trio of inhomogeneous or source terms, pertaining to the introduction 
of mass at the rate Q(r, t), the application of a force F(r, Г) with the com- 
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ponents F, and the action of stresses Т, (т, £), all per unit volume. For com- 
pact sources the fundamental solutions of (1) that are associated with the scalar 
and vector functions О and F represent monopole (simple) and dipole source 
fields, whereas the twice differentiated stress tensor T produces a resultant 
quadrupole field; in the context of turbulence induced sound the quadrupole 
strengths 7j, are to be identified with the corresponding Reynolds stresses and 
acquire their presumed order of magnitude through that of the eddy velocity. 

The entire energy loss of the primary source, e, is conveniently ascertained 
from the power expended in maintaining its state of uniform motion, and has 
the specific representations 


c? 
Emonopoie == — | osle, 0) Об, £) dr dt, 0) 
бо 
Saroe =} Vets 0) Fer, £) dr dt, (3) 
Equadrupote = — J Vor, t) . V . Thr, t) dr dz, (4) 


where the density and velocity potential of the secondary sources on the material 
surface present are linked by 


Qo 09x(r, t) 
(r,t) = —— — = 5 
a) o 2 (5) 


and o, is the uniform equilibrium density of the medium. 

If the surface responsible for the diffraction radiation is a rigid semi-infinite 
screen, and the point source travels in a plane which is transverse to the straight 
edge thereof with a speed indicated by the Mach number M (< 1), the radiated 
energy turns out to have the forms 


1 
monopole = —— ЕМ, Qo). (6) 
Qo d 
Edipote == В e jn Е) (7) 
апа 
1 
€quadrupole = "ny 2(М, B Qi) (8) 


where d measures the impact or minimum distance between the source trajectory 
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and the edge; explicitly, in the case of a simple source with fixed strength Qo, 


М? Qo? 


32x (1— М?)уз?? ° ©) 


(М, Qo) = 


independently of the angle or orientation of the trajectory, and for a dipole 
with moment Fy = œ F2)? in the plane of the trajectory 


FM, Е) = cos? 0 дай 1 i m 
Е  3XaLQ—M»yn? М? (бы )] ui 


where 0 denotes the angle between its axis and the direction of motion. The 
general expression for F, is lengthy, though a simple version 


TAR 1 (—4M} , 

XM, Qu) = 64л (1— M2)? Озз (11) 
obtains when a longitudinal quadrupole of strength Оз» moves in the axial 
direction. 

Two features of the radiated energy are significant, namely its universal power 
dependence on impact distance in each case and its manifest singularity when 
the source velocity equals that of sound; it is sufficient for the purpose of altering 
the former to envisage that the screen has some compliance or flexibility but a 
more thorough revision of the model is needed before the excitation due to a 
rapidly moving source can be confidently assessed. If the screen possesses an 
inertia described by a surface density of mass с a reciprocal length х = 200/с 
enters and the total energy loss from the source varies in a complicated overall 
fashion with respect to the dimensionless product x d, whose small or large 
magnitude contrasts the relative preponderance of surface or fluid mass for a 
region of scale d. 

To interpret the quadrupole output in flow terms the source strength requires 
specification for an individual energy-bearing region or single eddy; if the latter 
has the dimension (or correlation length) / and a convection velocity U, the 
assignment 


Q33 = Qo О? I? 
implies, as a consequence of (11), an order of magnitude 
Qo U* I6 
c? d? 


for the energy loss by radiation at low turbulence velocities. This corresponds 
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to an acoustical power generation of the amount 


inasmuch as //U characterizes a repetition time for successive eddy contribu- 
tions. The foregoing estimate accords with one by Ffowcs Williams and Hall, 
based on the far-field excitation of a longitudinal point quadrupole having the 
oscillation frequency U//, whose distance from the edge of a rigid half plane is 
small compared to the acoustic wavelength А = / c/U and whose axis lies per- 
pendicular to the edge. It thus appears that, in the matter of acoustical radiation 
efficiency, an immobile oscillating source near the edge of a rigid screen, where 
the flow has virtually incompressible nature, is rivalled by a slow moving eddy 
of fixed. strength which passes through the same vicinity of the edge; both 
models imply a low speed power level well above that for free turbulence, 
wherein the velocity exponent is eight rather than five. 

The theoretical analysis, which establishes previously stated results and also 
makes available information regarding the directional pattern of diffraction 
radiation, is commenced in the next section. Here the field of a simple source is 
synthesized in terms of time-periodic components, with given subsonic phase 
velocity along the trajectory and a transverse attenuation that is the more rapid 
at lesser speeds. The diffraction, by a semi-infinite screen or other material 
surface, of an arbitrary frequency representative of such inhomogeneous 
cylindrically symmetric waves thus presents the central problem of the theory, 
and after its resolution a fiequency integral leads to the secondary field from 
which the sought for aspects are directly inferred. In subsequent sections the 
relevant details are presented for dipole and quadrupole sources and, finally, 
the inhibiting effect of screen compliance on the radiation is brought out. 


2. A simple source in uniform motion past a rigid semi-infinite screen 


Adopting a system of coordinates (x,, x2, x3) wherein the screen is fixed and 
occupies the half-plane x, > 0, —oo < x, « oo, хз = 0, the point source func- 
tion 

Q(r, t) = Q(x, X2, хз, t) 
= Qo б(х, + dcosec à — Ut cos 9) d(x) 6(x3— Ut sin 9) (12) 


corresponds to motion at speed {7 along a rectilinear trajectory which lies in a 
plane normal to the edge of the screen; its inclination to and minimum separa- 
tion from the screen being given by the angle 9 (0 < 9 < z/2) and the dis- 
tance d, respectively. In accordance with the relation for irrotational motions 


Anon (13) 
с or 
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the velocity potential g(r, £) satisfies an inhomogeneous wave equation (cf. (1)) 


3 


d? 1 d? 
(У 5-55 «60 - 


= Qo 6(x, + d cosec à — Ut cos 9) 6(x2) 0(x4 — Ut sin 3) (14) 


0o 


and the particular solution that characterizes the isolated or primary source 
admits the representations 


Prt, і) = 


1 | exp [ у(х, — х.) + i2 (xa — x2’) + зх — х3) — io(t — t LN 


^ Qz* . E? + C5? + 06? — (w/c)? 
X Olx’, x2, xs', t") dx,' іх, ах» dt’ dc, dl. абз dw 
Qo sin à 
Ол)? o, U 


exp (i5, x, + = dcosec® + 16, х, + i[(w—l, Ucos 9)/ Usin 9] этш, 
х i 
| t — (0/0) cos 9? + sin? 9 (£2? + y?) 


x dt, dt,dew, (15) 


x oW? w\? " 
Ga. a 


'The causal nature of the preceding wave function, reflected in source contribu- 
tions subsequent to the time of emission, is assured when c has an infinitesimal 
positive imaginary part. 

To account for the presence of the screen, a secondary wave function of the 
form 


or, t) = 


1 fs exp [it (x1 — x4") + ib2(xa — х2) + ibs(xs — x3') — io(t— t 31 
NU 6,2 + 65? + 32 — (w/c)? 


X f Ga, хо, t") д'(хз') dx,” dx; dx’ dr' dé, 46 dí do (17) 
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is introduced; this involves a source factor 6’(x3) which refers to the plane of 
the screen and implies a discontinuity in the velocity potential there which is 


measured by the other, as yet unknown, factor f (x,, x2, t), x > 0, —oo < хә, 
t< оо. If 


J (Ci $2, @) = fi dx, / dx, dt exp (—б, x, — i$, x, + iot) f (x1, х2, t) (18) 
0 —00 


denotes the multiple Fourier transform of f(x,, x2, t), the expression for 
P(r, t) becomes . 


g(t, t) = 


i j exp (if, x, + ila x2 + its x3 — iot) 
eee eee ы ы жыш 


^ Qa GONE NE E a 


= о 


(19) 
whence, after the integration with respect to £3, it follows that 


фәт, t ) = 


© 


1 | oM? 1/2 
+ 2 (27)? {ii ESI + it, X2 T 1) == 6,2 = 24 |з] — ior} x 
X f (5, 62, 0) 46, 46, dw (20) 


where the + signs are intended for x, 2 0 and 


wN? 1/2 
wef (2) e] ж 
[4 


If the screen is rigid the conditions 


0 " хі > 0 
= (21) 
x3 =0 ga, X25 t), Xi < 0 


d 
——(ф, + Ф) 
0x3 


prevail in its plane, and there obtains from (15) and (20) a functional relation 


i QN? 1/2 Qo 
ZG. 2, 0) (2) | —i (о —£, Ucos 8) x 
2 4 Qo О? 


exp (i$, d cosec 9) 


E: — (9/U) cos 9? + sin? 900,2 +») - 22 
à (C1 S (0/0) cos 0p + sin? Q (2,2 + y?) aes, 6, ©) ( ) 


246* HAROLD LEVINE 


involving the transform 
о © 
2(,, 62, 9) = f dx; f dx, dt exp (—i6, x, —i2 хг + iot) &(ху, X2,t) (23) 


—‹со — 00 


of the normal velocity g(x;, x2, £) on the half-plane x, < 0, —со < x; < oo, 
хз = 0. The functions f'(£,, £5, œ) and g(¢,, £5, œ) are analytic in the lower 
and upper portions of the ¢,-plane, corresponding to Im ¢, 2 + Im о, and 
the former is given by (w > 0) 


2 1/2 1/2 20 1 
fi 62, o| (Z-t) -i | = : 


® Qo U? 2ni 
(о — vU cos 9) exp (itd cosec 9) 1 dr 
| [t —(w/U) cos @]? + sin? 9 (62 + y?) [(@?/в2—„?)!?-+т]!'?® т — б, 


4 


with an integration contour parallel to the real axis of the t-plane that passes 
above т = б, and between 


T=, = сов 9 + isin 9 (62 + yy”. 


It is the singularity at т, which determines the value of the integral if the 
contour is closed in the upper part of the t-plane, whence 


10, c sin 9 — iU cos 9 (6,2 + y?)!? 
4 „ [4 > Бакр йе eee, Hee a, RNID UP URN 
f, £2, ®) оо U? (2,2 + y?)*/? Ala, £5, ©) 


exp (id cosec à [(w/U) cos 9 + isin 9 (£2? + y?) ?]] 
X ————————————————————————————— 


A 24 
С, — (w/ U) cos 9 — i sin 9 (62? + y??? (24) 


AQ, 62, w) = 
о? 1/2 
== ee 2 
-[(6-) + 
w 1/2 o? 1/2 1/2 
+Z cos 0+ isina (E y^ | (Ee) -i | А о> 0 
с 
revealing ап asymptotic behavior 


Si; £5, ©) с сс, || — 00, Im n < 0 
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linked with the coordinate variation 
РО, хә, t) єс x, ^, xı > 0+. 
From the equation of balance 
ofl 1 Qo (dy V? 0g op 
—| zeo (VPF +-—{| — vc Уф — | = —0 — 25 
SEZ Ф) is) Qo УФ 2- (25) 
whose left-hand terms pertain to the acoustic energy density 
1 1 go / 9p V? 
= – 00 (V9) --—|— 
5 0o (V9) 2c ( ai ) 


and energy flux vector 
op 
5 = —00 Уф —, 
ot 


it appears that for a rigid screen the total energy radiated by the primary source 
may be written in the form 


© 


99, 
e= -f Q(x, X2, хз, £) < ax, dx, dx; dt (26) 
t 


-0 


inasmuch as the space-time integral of these terms reduces to a time integral 
of the radial component of S over a control surface of large extent. Employing 
(19) and subsequently effecting integrations with respect to X4, X», Xa, 1, ёз 
and ¢, in that order, 


Qo is exp (—il, d cosec д + if, Ut cos 9 + it; Ut sin 9 — iot) Y 
Nar Mitten oi Bx иче PST i DS MAG, d 


b; + C2? + бз? — (w/c)* 
х (б, 62, о) dt dé, 4, С; dw 


со 


Qo J w (0 — б, U cos 9) exp (—i£, d cosec 9) 


~ Quy U? J [& — (ujU) cos OF + sin? (Ly? +>) 


f (Eis £5, e) 46, d£, dw 


со 
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m Qo г wsin d+ iUcos 9 (2,2 + у?)!? 
“an eee 


w 
х exp Е 9 (© cos  — isin 9 (£;? + p») x 


x (© cos 9 — isin 9 (2? + y?)!2, 6, v) df, dw 


with 
о . 9 
(© cos #— isin 9 (C2? + y?) Čas a) = 
| Qo sin 9 — iU cos 9 (6,2 4- y?) ? 
2 20,02 зіп 9 (022 + y?) Ata 0) 


w 
х exp L $ (Zos 9 + isin 9 (,? + vy) (w > 0) 


and 


1/2 


w? 1/2 w 
А(С., o= (Z-t) ыш" +з] х 
w? 1/2 w 1/2 
x (=) -2 cos 8+ isin 8 (Ca? + 7!" | Р 


Thus 
Qo? ^?  r.. e [o sin  — iU cos 9 (£;? + y?)?] 


= ————-Re | de | 
2 (2л)? оо U* | |] : (622 + y??? AEn ш) 


О? 


—24 2, 2MJAY LB x 
x exp [24 (£7? + y?) ?] 2 (2m)? оо U* . 


o © 


wU cos 9 (62 + y)? [v sin 9 — iU cos 9 (6? + y?)*/7] 
x Im | do fih — — —À —— D > X 
J sin 9 (2,2 + y?) A(6,, о) 
x exp [24 (£7? + y?)!?] 


-%0 
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and if the variable т be substituted for £5, in accordance with 
w 
б = — 7, 
с 
the fact that 
" w 
A(t, w) = a (M (1 —+?)/? + cos 9 + isin 9 [1 — M? (1— тр 
X (M (1 — т2)1/2 — cos 9 + isin 9 [1 — M? (1 — т?)]!/2 11/2 
is purely imaginary for |r] > 1, yields 
2 2 loo Н 2 2311/2 
sin 9 — i cos 9 [1 — M? (1 — :?)] 
Q^ „аһ [an Inl a a cene 


~ Qn)? о Uc [1 — M? ( — 2]? Az, o) 


Qo? cosec # 


w 
x exp {| —2— [1 — M? (1 — 212 d —— X 
»( a! ка ) (n)? б Uc 


z cos 2 [1— М2(1—т2)]:/2 {sin $—icos @[1— M? CE 
x Im J wdw | dz 
[1 — M? (1 — т2)]2/2 A(z, w) 


w 
X exp (= [1 — M? (1— 22) г) 
Let т = sin х and, taking account of the representations 
ЖЕ w 
A(sin æ, œ) = T [М cos æ sin 9 + i (1 — M? cos? «)!'?], 


Е w 
|A(sin о, @)| = T (1 — M? cos? a cos? #)1/?, 
iv follows that 


[2 
cos? х 


e 
= £ је dT exp (27 (1 — M? cos? g)!/2 4) da 
SI Qo € а — M? cos? о)3/2 U 


1 Qo” M? (27) 
— 32m eod (1 — M32 


without dependence on the angle 9. 
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The simplicity of this result, whose interesting features are to be found in 
the variation with source speed and impact distance, points to the desirability 
of independent, albeit approximate, calculations that have a supporting role 
and illuminate the constructive basis of the radiation. A particularly accurate 
approximation scheme can be devised when the source has nearly sonic velocity 
and the constituent parts of its spectral resolution are but slowly varying in the 
direction normal to the trajectory. The interposition of a rigid and impenetrable 
screen in the path of a time-periodic plane wave gives rise to forward scattering 
at the incident amplitude level in the geometrical shadow region, and an estimate 
for the total secondary or diffraction radiation is thus provided by the overall 
change in energy of the primary source field in this specific region of appreciable 
interference. To ascertain the latter quantity the primary source wave function 
is conveniently expressed in terms of coordinates (z, r) along and normal to 
the trajectory, respectively, whence the counterpart of (15) assumes the forms 


2n 


Qo Р {2 K exp (irt cos 9. 
gy, r, 0з tmm) PIA 24y dy 
Qo 2 x Jo(r QE 
ЕЕЕ 


— Qo 2 lol 2\1/2 
= олу "n ; [5 EDI (а-мә ) 


Qo 1 
алоо c — Ut? + 0 — M?) ? y^ (28) 
4л оо [(z— Ut? + (1— M?)r?y ? 


whose intermediate versions contain cylinder function factors. 

Let the trajectory be normal to the plane of a semi-infinite screen and pass 
at the distance d from its edge, so that the shadow region corresponds to the 
assignments 


x >а, —o0 < y < о, z>0 


where r? = x? + y?. On calculating the integral of 


10 (=) 
2c? \ dt 
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over this region the instantaneous potential energy therein is: given by 


со 


Qo {2 
t i 2 d ————————————— 
Wy, (t) i 0o (= -) M | di vja [é? + a— M?) r?P? 


-Ut 


and the net change, 
4 Wrot = (С) т> Woo), 


in the course of time has thé value 


© 


Qo V 59; 109 ge 
dë f ay [ dx —— 
ral di MEE 
Qo M? 
iis CE 4d (1 — M22 


The analogous characterizations for the kinetic energy, obtained from inte- 
grals of the density + оо (V ф„)?, prove to be 


AW, -ie( 


2 £2 + (1 — M2)? r? 


-y f afo ја [2 + (1 — M?) 2] 


Weinlt) = + 00 (= 


and 


Qo 1 Qo 1 
ДИ = $ Qo | —— S e = d 
Aft Qs. 0о/ 4d(1— М?2)3/2 pum Qo 4d (1 — М?у!? 


Accordingly, the energy content of the primary source field in the shadow region 
of the screen changes by an overall amount 


Qo 


4л Qo 


=e ( Qo pu 


= 1 — M2)-3/2 
£p 2 4d ( ) eie 


ЈЕ picis a — M?)-1/2 
47 Qo 


as the entire trajectory is traversed; and if an equal amount is added thereto, 
by way of accounting for the symmetry of the scattered field on opposite sides 
of the screen, it follows that 


2 
26, ~ We (1—M2)72,  M—1, 
32z 0o d 
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in conformity with the prior determination of the total energy ascribed to 
secondary or diffraction radiation. This argument carries over without sub- 
stantive change if the trajectory has any orientation relative to the screen, and 
the same estimate obtains in terms of the appropriate impact parameter d. 


3. Dipole and quadrupole sources in uniform motion past a rigid semi-infinite 
screen 


If a point dipole with steady moment Fy moves along the rectilinear trajectory 
envisaged above for the simple source, having its axis in the plane of motion 
and inclined to the screen at an angle y, the pertinent wave equation is 


3 


(кон е 
і cos y — + sin y — | x 
ox? с2 Ot? E "n ý =) 


i=1 


х (х, dcosec 9 — Ut cos 9) d(x2) (x3 — Ut sind) (29) 


and the particular solution, o,(r, t), which specifies the resulting density varia- 
tion takes the form 


iFysin 9 С, cos y + (о — £, U cos 9)/U sin 9] sin y 
0) = | x 


о л)? е 2U J [6 — (010) cos 0]? + sin? 9 (62? + y?) - 


w — č, U cos 9 


x exp (i x, + if, dcosec ® + бх + i - 
U sin 2 


X34— ш) df, dé; do 
(30) 


where the prior notations remain in force. À homogeneous solution of (29), 
appropriate to the rigid screen at which the secondary source distribution is 
given by u(xi, x», t) 8'(хз), xı > 0, —oo < х < oo, may be written as 


esr, t) = 


i | exp (бу x, + ita x; + its хз — iot) 


— (Ол)* E2 + б? + G? — (00). "a Cos о) d£, df, абз dw 


Е foo (C, x, + ito х + i KoA — 06? — 07]? || — iot} x 


xg Cee a) хош (31) 
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with 


и(б1, 62, ©) = Је, Tix dt. exp (—іб, x, — iz x. + ior) MX, х2, t) (32) 


-00 


and the wave functions оь, o, are joint analogues of the pair advanced in (15) 
and (19) for the simple source. 
Applying the complementary conditions 


0 5 x, > 0 


© 
— (0, + @s) = (33 
0х3 ш хз=0 »(xi, х2, 1), X, < 0 l 


an equation relating the transforms u(¢,, £5, œ) and 


о oo 
*(5,, 62, w) = / dx, f dx; dt exp (—i5, х, — ib x4 + iot) »(xi, x2, t) (34) 


obtains in the strip |Im Z,| < Im œ, namely (compare (22)) 


i w\2 1/2 Fo 
269 (=) -&-te els E t, Ucos 0) x 


: cos y + [(w — б, U cos #)/ sin 2] sin y 


d = 
С, — (0/0) cos 0]? + sin? 9 (2,2 + у?) exp (© d cosec 8)— *(5,, 62, c) 


and the former, which is a regular function in the lower half of the Čı-plane, 
proves to be 


Fo w sin 9 —iU cos 0 (6,2 + у?у!? 


М 0) = V оа PAC, te) 


[(@/U) cos (9 — y) + i (£2? + y?)'? sin (9 — y)] 


X exp [ d cosec à Е cos à + isin ð (2,2 + 2)! a (35) 


Inasmuch as 
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g(r, t ) = 


eo 


_ c? (= (il, x4 + #5 x2 + із хз — iot) 
(2л)* Qo / o. 6? + 62 + £4? — (w/c)? 


-00 


ШОС, бо, e») аб, 16 ds dw 


by virtue of the connection (13) between the velocity potential and density, the 
radiated energy, calculated on the basis of the identity 


c? o? 


" )*v.eevocr. vo (36) 
о 


d 
— (3 оо (V9? +4 
ot 


and expressed in the form (3), becomes 


E= 


© 


ic? Fo w—l,Ucos # б, cosy + [Ко — 6, cos 9)/U sin 9] sin y 
NU al w [¢, —(@/U) cos 9] + sin? 9 (2,9 + y?) 
x exp (—it, d cosec 9) u(C,, С, w) 46, d$; dw 


inc? Fo = iU cos 9 (£7? + y?)!? 
шш ра ee Se аса арр 
(271)? оо U?_ о (22 + у®)!? 


о 


х (Tesco wi? yn 0—0) x 
w 
x exp (17 doot — (Ca? + ya) x 


@ 
xu (5 cos 9 — isin 9 (622 + y?) ?, С, °) 4, do 
Fo? ә м2 cosa [1 — M? cos? a sin? (8 — y)] 
з HEN | do | di =m 
Qn eter J { (1 — M? cos? аў'? 


о a) 
X exp (-2 ET (1 — M? cos? a)? à) 
2/2 


Fo? 1 — M? cos? a sin? (9 — y) 
f cos? a —————— ———————— 


EM ME da 
8л? оос? d (1 — M? cos? а)? 


о 


1 E [Гсоѕ? (9 — sin? (9 — 1 
32 n sem (ae (37) 
32 л ос? d L (1 — M??? M? (1 — M?)*/2 
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One or the other of the twin contributions vanishes when the angle between 
the dipole axis and the trajectory, 0 = #— y, assumes the particular values 
0 = 0, л/2; in the former circumstance, that is for a dipole aligned with the 
trajectory, the result may be simply inferred from that appropriate to a mono- 
pole source since coordinate and time differentiation of the respective source 
terms in the wave equation for the density variation are directly proportional. 
А different velocity dependence is indicated for the dipole with axis set normally 
to the line of motion and its salient feature at high speed (M — 1) is revealed 
by arguments involving the primary source function in the manner described 
earlier for a simple source. Thus, the density function of a point dipole travelling 
along the z-axis, with moment Fo along the x-direction, has the form 


Fy (1 — M?) x 


one Tce EY Se 22. 2 
4л c? [(z — Ut)? + (1— M2) r?] 3/2’ (r? = х? + у?), (38) 


MEA ғ, t) = 


while the concomitant velocity potential is 


Fo x z—Ut 


meo Ure- uyta- marp ©? 


Plz, r, t) = 


If the z-axis is normal to the plane of a screen occupying the half-plane 
x >d, —00 < y < со, z = 0, the potential energy of the given dipole field 
in the geometrical shadow 


Woolt) = iz xd [ vj dx [0,(2, r, t) 


alters by the amount 


A Woot = Woo) 2 Wror(—o) 


c?/ Fy V M: "o d£ 
2. -MF | ду] Xe OR а (40 
alae) ao fo ae жир 9 


F,? (1 — М?)-12 
0o c 32 ла 


that evidences a singularity of the inverse square root nature as М — 1. 
The analogous calculation in respect to the kinetic energy, 
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о со 


Welt) = F eo | dz f dy j dx [уф r, OP 


о —00 


2 x? (1 — M?Y [(z — Ut. + r?] 
= (s z) fefoje e a a 


x? (1 — М?) (z — Ut? (z — Ut)? 
r*((z—Ut ++ (1— М?) т?р е [0 01) са) 


(41) 


is facilitated by concentrating оп the dominant contribution; a rearrangement 
of the last term, viz.: 


(z — Ut)? 1 1— M? 


r «е0 + (MAr r^. r?[(z— Ut + (1— M?)r?] 


isolates a divergent — though time-independent — component which is of no 
consequence for the quantity of interest, A Wyin = И (00) — И (00). To 
characterize the singular part of the latter in the limit M — 1 the terms in 
(41) which contain a factor (z — Ut)? may be discarded and accordingly 


Fo ы dé 
РЄ = 2\2 ————————— U— 
AW ~ Feo (s 2) йш di (Л ee aii [? + (1 — M2) r?]?" i 


duplicating the magnitude of AW,,,. The total amount of diffraction or 
secondary radiation that is deduced from this measure of energy change in 
the primary source field and takes cognizance of equal forward and backward 
scattering by the rigid screen, namely 


2 (1 — M?)-1/2 


— M—1 
Qoc? | 32md 


agrees with the prediction of the exact formula (37). 

If the primary source has a quadrupole nature the analysis of diffraction 
radiation, along the lines used heretofore, becomes more complicated in detail 
rather than principle. It can be anticipated from the dipole radiation formula 
(37) that а common dependence on the impact parameter d is characteristic 
for all components of the quadrupole strength, while the corresponding velocity- 
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dependent factors are individually specified. With a view to determining this 
general feature and to achieving an economy of calculation, it will be assumed 
that the trajectory lies in a plane normal to the edge of the rigid screen and is 
symmetrically disposed relative to the latter (i.e., # — л/2 in the notation of 
sec. 2); then the wave equation for a quadrupole source moving along the 
x-direction and passing the screen (x, > 0, —œ < x; < 00, хз = 0) ata 
distance d is 


o? dT, 
) —— et 0-—— | 
ox? с? dt? 2n 0x; . 


d? 
=—— >, Qu ó(x, 4- d)ó(x;)ó(x4— Uf) 


Ox, OX; 
(42) 


where the Q;; form a symmetrical array, Qj; = Qi. The integral of (42) which 
refers to an isolated source, 


ovr, t) = 


1 [= [iči (х, — x4") + 1б) (x2 — х2) + ifs (хз — x3) — iw SO). 


(2л)* c? 612 + 65? + 64? — (w/c)? 
x ——— dx,’ dx,’ dx,’ dt’ dl, 16, dl, dw 
ox,' ox, 


is expressible in the form 


ejr, D) 7 — Ou GU x 
ке 2 
хоз 301, tat Qs) 2016 7 20507 2+0.(5 ) |x 


“ exp [ifi (x, + d) + 1б, x2 + iw (x4/U — t)] 


GP xU y NUN sa 
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es(r, t)= 


i [ exp (i, x, + iz x2 + із хз — iot) 
аа а 


о ра — E. tn md dls dL. dE do 


-%0 


(44) 


represents the wave function of secondary sources at the screen the boundary 
conditions in the latter plane imply that | 


i w\2 #27 o 
520, tso (2) -tt | ESTA x 


2 
| out +20,»&, t2 + Q22 2° +2013 G7 +20 а= + O33 (=) |; 


exp (i, d) 
коме S aic EA vius 45 
зүү” 09 
with 


SE | , (90 
4@ 69) = | dxs | 4х„йгехр it x; — хә + iot) (=) -. — (49 
-0 = о 3/ x330 


On recasting the transform relation (45) so that individual terms are regular 
in the lower and upper halves of the ¢,-plane, respectively, there follows an 
explicit representation (w > 0) 


р(ё,, 62,0) = — I [Qi (£2? + y?) + 2701262 (£2? + ayer + 026,2 + 


+ 2i Q;3(@/U) (£2? + у" +203600 + Q33 (@/U)?] x 
x (E; ае £^ ра?) + Ca? by н?г! x 


2 231/2 
exp [— (62? + y?) 4] "m 
& —i (5? l y2y? 
which, together with (44), fully determines the function о,(т, t). 

A single change in the energy balance equation (36), namely the replacement 
of F with —V . T, provides the version suited to the primary quadrupole 
source and thus the energy radiated by virtue of the interaction between the 
source field and. screen assumes the forms 
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© 


в=— | Vgs. V T dx, dx, dxa dt 


= о 


E dx? 19x ӧх,2 
dps 020, 029, 
+ 20,3 +2023 + Q33 J x 
х; 0Xs 0X5 0X4 0x3 


X (х, + d) (x2) (хз — Ut) dx, dx, dx; dt 


c? 


-—— — —— x 
(27)? оо U? 


o 


T f Qui? +20420, %2 05507 +20130,@/U+20.36.0/U+ Q33(w/U)? 
TAN 6,2 +62 + у? 
x exp (—16, d) р(С,, б, œ) dl, dC; dw (48) 
inasmuch as 
с? 


p(T, ) = — (2л) " x 


f Ča exp (itx, + ita x2 + its x3 — iot) 


m GU OS (w/e)? P(E1, 62, w) dl, 46, dl; dw. 
i cb G2 3 — 


= о 


On evaluation of the ¢,-integral in (48) and subsequent reference to (47) for 
details of the function p(¢,, £2, œ), it follows that 


л с? 1 
Ee es / шл: ый 
On) oo U* J. (С, узу? 
x [QE + y?) —2i О,» ,(,2 + у?) + Qaa £7? + 
— 2i Qia (w/U) (C5? + yP)? + 2054 C; elU + Q33(w/U)?] x 


X p (—i (02? + y?)1?, Ea v) exp [ (£2? + y?)? d]d£, do 
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1 7 do m exp [—2 (£2? + у®)'? 4] 
© 2(2л)?о„ U? Ji d : (622 + у2)2/2 


w? 1/2 
x (e) +162 ey x 


x {| —0„ (£2? + у?) + Q22 62? + 2023 22+ Озз (5) | + 


a 
+ 4(62? +| es 6 + 27. | 


1 (= a [0437 + Q127 M? sin? о] awe 1 
= —————— [———————————————— da 4- —————— X 
(2л)? оо c? dc (1 — M? cos? à)? 4 (2л)? оо c? d? 
1/2 2 
| ши {[Q33 + О: M? sin? «— О,, (1 — M? cos? à)? + 
x | ————— — — « 
Jac M? cos? a? 33 22 11 


+ 4 M? Q23? sin? а} da 


1 
= noe pn Q12? + аџз Q13? + a23 Q237) + 
o 


1 
23 [а\\ Qii? + 222 Q22? + аз» Q33? — bi; Q11 Q22 —b,3 Q11 Q33 + 


167 Qo c? 
+ b23 Q22 Q33] (49) 


with coefficients ау, bij, that are functions of М?, given by 


1 1—4 M? er 

12 оу aaa} ауз = +(1— М?) > 
М a May L (аму —1 
a = —— — кт T a == — = — ^ 
23 16 1i 2M? 
1 — (1 — М2)? х 

455 = 4 (1 — M?) 1? ae Se ) 

1—iM? 1 1—4 M? 
a33 = 0 —À— > 12S = a} 

(1 — М2)5/2 M? (1 — м2)? 


2 


M 
bys = 3 — M7)7^, bas == M8, 
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The specificity of individual quadrupole components is thus revealed in their 
velocity-dependent factors and for rapid motion in particular, it is the longitu- 
dinal quadrupole aligned with the trajectory that radiates most strongly. Since 
012, 023, 422, Ё, and b;4— 0 when M — 0, while the other factors assume 
finite values in this limit, it appears that slowly moving quadrupoles radiate 
more effectively if their axes are normal to the edge of the scattering half-plane. 
The universal dependence on impact distance in (49) contrasts with that per- 
taining to the separation from the edge of a fixed quadrupole whose strength 
is time-periodic; according to the analysis of Crighton and Leppington the 
scattered field energy of the latter quadrupole varies with an inverse power of 
its distance from the edge (that depends on thé orientation of source axes) 
provided the wavelength has a much larger magnitude. An increased radiation 
efficiency is, in fact, common to the moving (albeit steady) and fixed (though 
oscillatory) quadrupoles when one or both of the axes are normal to the edge 
of the screen. 


4. Frequency and directional resolution of the secondary fields 


The appearance of time-periodic components in the primary and secondary 
wave functions is consequent to their Fourier integral representations, though 
the manner of calculating total radiated energies adopted earlier bypasses a 
direct involvement with the spectral and directional character of the radiation. 
To examine such features the frequency synthesis 


pat, t) = = Г ét, а) exp iot) do (50) 


is given an explicit status, with the recognition that reality of the wave function 
implies a conjugate behavior 


g(r, —) = G*(r, а). (51) 
In accordance with (17) the time Fourier transform of 9,(r, t) in the case 
of a simple source takes the forms 
er, w) = 
І ү lis баа) + а би — ж) + ia Qs 5D] 
(27)? 6? + 62 + 63? — (w/c)? 


-%0 


X fr‘, х»', о) д'(хә') dx,’ х, бхз 4б, dl, d£, 
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1 ò E [ič ба) + ha (62 ха) + axal 
^ Ол) 0x; і 6? + £5? + 32 — (w/c)? 


X f(x, х»' ,w) dx,' dx;' 46, 46, з 


SREL T ы-и 


4л бхз [(х, — х)? + о — х) + $4] 


X fGu',x;,e)dx,dx, (@>0). (52) 


If x, =r sin g cos f, x; =r cos &, Хз = r sin asin f and the far-field ap- 
proximation 


d exp {i (w/c) [01 — x) + (x2 — х) + x,2]/2) 


7 


©хз [(х, — x1? + (xa — x"? + x^? 
o. _ exp (iwr/e) w | 
& i —sin a sin f ———— exp { —i — (x,' sin a cos f + x;' cos о) 
с r c 
(о r/c > 1) 
is employed, the consequent version of (52) becomes 
» o. . exp (iwr/c) w 
plr, w) ~ — i —— sin a sin B ———— —f © sin «cos 688 a, °), 
4л с r c 
о > 0 (53) 


where the multiple transform of f has arguments depending on the direction, 
but not the magnitude, of the vector r(x;, x5, хз). 
The time integral of the radial energy flux, 


09, 09, 
Qf Or , 


S, = —00 


can be expressed as a frequency integral involving the Fourier transform of Ps, 
namely 


оф(г, o 
do 


fs sim Pus [ oe ©) 


and hence, utilizing the relation (53), it follows that the whole amount of 
radiated energy is given by 
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п 2n 


ge | sin х dg f df sin? «sin? B. х 
0 


e€ ————— 
(4лс)? лс К 


со 
x f 


0 


do. (54) 


w, wW 
f| — sin «cos B, — cos a, w 
c c 


From (24) the pattern or angular distribution of diffraction radiation at a fre- 
quency w is proportional to 


2 Qo? 
i 0o? 
sin а (1 + cos В) (1 — M cos 2 sin о) exp [—2 (w/U) (1 — M? sin? о) d] 
(1 — M? sin? о) [(1 — M sin а cos # cos B? — M? sin? х sin? д sin? В] 
Qo? 


sin? х sin? В w* 


x 


Ф. w 
f| — sin «cos B, — cos a, w 
c с 


sin a (1 + cos f) (1 — M cos ? sin а) exp [—2 (0/0) (1 — M? sin? о)! d] 
(1 — M? sin? o) [1 — M sin a cos (В — 2)] [1 — M sin « cos (B + 9)] 
(55) 


To confirm the independence of e with regard to #, which specifies the angle 
between the trajectory and the screen, the result 


2n 


Í 1 + cos В 4 
(1 — M sin «cos 2 cos B)? — M? sin? а sin? 9 sin? B 
2л 1 


~ (1— м? sin? @)!'2 1 — M sin «cos д’ 


0 Š 
(56) 


obtained directly by contour integration around the unit circle, is apt; for, 
taking account of (56), it follows that 


€ 


2 A е "sin? a exp [—2 (w/U) (1 — M? sin? а) d 
Qo u fao f p [—2 (0/0) ( ) lu 
0 0 


~~ (Anc) ge (1 — M? sin? а)2/2 
1 QA? | sin? a da 
© (Any ood J (1— M? sin? a)? 


0 
1 02 м? 
32 л ооа (1— M22" 
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in full agreement with (27). 

The radiation pattern is evidently sensitive to the velocity of the source and 
the shaping factors are found in the numerator and denominator of (55), re- 
spectively, according as M — 0 or M — 1. At small velocities the directional 
features follow from the simple product expression sin @ (1 + cos f), whereas 
for large velocities the intensity is concentrated. about the plane of motion, 
а == z[2, and in particular near the directions В = à and f = 2z — 9 which 
correspond to the source's track and a reflection of same relative to the screen. 
The spectrum of the radiation is controlled by a single exponential factor, with 
a limiting frequency that depends on the velocity and impact distance. 

If the density rather than the velocity potential has a central role in the 
analysis, and 


o 
Lx 
esf, t) = — J o(r, w) exp (—iwt) dw 
2л Ed 
the time integral of the radial energy flux can be displayed in the form 


[s dt = 


utilizing the evident counterpart of the far-field transform (53) for a dipole 
source 


oo(r, o) do 
Г, o) = a 
w 


1 . ехр(їю{с) [о . Ф 
sin « sin В ————— u | — sin g cos B, — cos х, w |, w > 0, 
r c с 


5, ME 
e(r, w) ~ — i 
4z c 


where the function и is specified by (35), the radiated energy becomes 


e = 
т 2n © 
c . . . о „ w 5 
| sin ada | ав sin? asin? f f w? ju (sin aeos cos a, v) dw 
(4л)? лоо с с 
0 о 0 
(57) 
with 


2 2 


F, 
sin? а sin? B œw? = oa sin «(1 + cos В) x 
с 


o. w 
u| — sin a cos B, — cos a, w 
c c 


X (1 — M cos 9 sin о) [1 — M? sin? « sin? (8 — y) x 
x exp [22 (0/0) (1 — M? sin? a)!? d] x 
x ((1 — M? sin? a) [(1 — M sin а cos ? cos B)? — M? sin? a sin? 2 sin? i. 
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The salient features of this intensity pattern are, save for a specific dipole factor 
which depends on the angles 0 = #— y and х, shared with those of a simple 
source and the evaluation of (57) reproduces exactly the formula (37). А rela- 
tively more intricate distribution of radiation befits the quadrupole source with 
multiple components and its details are not pursued here. 


5. The effect of screen inertia or compliance 


The absolute rigidity of the screen assumed heretofore produces a stronger 
response to the moving source or primary excitation than is the case when it 
possesses a finite local inertia and becomes capable of stimulated motion. A 
study of the extent to which the diffraction radiation can be lessened by screen 
compliance thus merits attention from both the analytical and practical view- 
points. In anticipation of the extra complexity which inertial effects bestow on 
the analysis it is appropriate to commence with a two-dimensional problem, 
wherein the source has an indefinite linear extension parallel to the edge of 
the screen. 

If the screen is situated in the half-plane x, > 0, —œ < x, < оо, x, = 0 
and a simple source of strength Qo per unit length in the x;-direction travels 
along the x3-direction with a constant speed U, the requisite form of the wave 
equation (1) becomes 


(= е л eee ido. dd auc 
Sa , Xa, D) = — xa — Ut 
dx? dxa? c? эз) re з Qo ч x 


where d represents the impact parameter. The primary and secondary wave 
functions are correspondingly inferred from their three-dimensional counter- 
parts and have the versions 


со 


Qo / exp [i, x, + i6, d+ іо (x3/U—1t)] 
(2л)? оо U Ty 


—со 


9, (X1, Xa, f) = — d£, do, (59) 


9x5, X3; t) = 


W 


1 | exp [у (x, — x4") + 163 (хз — x3) — io (t—?’)] x 
(27)? 6? + 6s? — (w/c)? 


x f, t) 8(хз') dx,' ахз’ dt’ d£, а, do 


со 


_ i {= (16, x, + ifs хз — iot) 
(2л)? 6,2 + £37 — (w/c)? 


SEn o) d£, d£, deo 


-%0 
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со 


i. on eu exp fit, x, + i Koc)? — 6,2}? || — iot) f (д, о) d£, do», 
x, z0 (60) 
suited to the present circumstances. 

There is a discontinuity in the velocity potential v, and hence in the pressure 
at the screen, as for the rigid case, though the normalvelocity no longer vanishes; 
on the assumption of small amplitude motion the dynamical boundary condi- 
tion applicable at the initial position of the screen, namely 


2 


d d с 
Ro == Ф 0+, =: 9; 0—, th= a 


(Pp + Ps) > Хі > 0 (61) 


x3 ot хз=0 


merely equates the pressure difference between the faces to the surface mass 
density, о, times the normal acceleration. With this particular type of local 
interaction between the screen and the surrounding medium the functional 
equation that determines the secondary source transform f(¢,, w) turns out 
to be 


20 Qo exp (i6; d) 
Qo U? 6,27 + у? 


2 1/2 
(= = а) К(Ё,, о, OF (Ду, v) — = g(C1, o), (62) 


where 
x 
K У, =; 1 len ыйы кн ia 1 
(б, w х) + [2,2 — (оюусу?! 
х = 200/0 


and g(¢,, œ) is the transform of the normal velocity іп x, < 0, хз = 0. The 
functions f, g, analytic in the lower (—) and upper (++) parts of the ¢,-plane, 
respectively, may be individually characterized after bringing (62) to a form 
wherein all terms have a similar property; the necessary rearrangement con- 
forms with a pattern outlined earlier and yields 


i d 1 K, (iy, œ, 
FCs; o) ST = essa du ыйа ыш oe Б) = =з (63) 
oU?y б —іу (œle + у)? (wle — &)'® K (05, 0, х) 
w? w? 1/2 
U? с? 


if the factors K,(£,, œ, x), regular and non-vanishing functions of ¢,, in the 
designated parts of the plane, are such that 


K.(0,, о, х) 
К(,, о, х) = Ko.» . (64) 


x 
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It is the parameter х and thence the factors K,(¢,, œ, x) which account for 
the compliance of the screen and evidently x — 0, Ку — 1 as the mass density 
с tends to infinity. 

The relation (25) for the velocity potential of an acoustic field in the presence 
of a given primary source is again pertinent to the aspect of energy transfer 
from the latter; integration throughout the plane exterior to the screen, and 
over all time, of the term which contains the divergence of the acoustic energy 
flux vector furnishes one component 


e © 2n б 2 
£ Ја f r( cos о 
8л? c E c 


that superposes the diffraction contributions at all angles œ (x, = r cos a, 
хз = r sin а) and another, 


e = 


sin? а da (65) 


Е 0+, t) 0ф,(х;, 0—, "| 
Co} тае 
or oz 


© o 
d dy \? 
jefe. 
б 204 ot 0x3 хз=0 


that is transformed with the help of the boundary condition (61). A null value 
of the second component is indicated, since the local kinetic energy of the screen 
motions at the extremities of time enters and the excitation acquired during 
the passage of the source is ultimately communicated from the screen to the 
surrounding medium and propagates outwards. 

Thus (65) specifies the total amount of diffraction radiation which is given, 
in alternative fashion, by 


9g; 
4 dx, dx; dt 
ot 


в = —0 | 8(x, + d) 6(x3 — Ut) 


-00 


fa з ыллыы bu) dis dis dade 


&? + 6з? — (w/c)? 


* 
о 


| je (Ait, d) 
QayU* G? + у? 


f(&,, w) o? dt, dw 
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2 ow DiC n 66 
^E 2 exp (=r df (iy, а) ——. (66) 


The consequence of introducing into (65) details of the function ` 
w 
f (2 COS a, о) 
с 
provided by (63) is an expression 


Q^ М 
e = 
8л2 оо 1 — M? 


7 do w 
f[— ap ЕЕ 2) x 
cw U 


2n 
1+ cos а 2 


x |= 
E 1 — M? sin? « 


К. (у, w, x) 


EERE (67) 
K- [(о/с) cos a, w, x] 


which makes it clear that the diffraction radiation from a line source diverges 
logarithmically in the case of a perfectly rigid screen, inasmuch аз. the factors 
К. are then equal to unity. To assess the magnitude of ғ for a compliant screen 
requires explicit knowledge of these factors when ж # 0, and if it be recorded 


that . 
w 
K_{ — cos a, w, x 
c 


(with the supporting arguments for this and additional properties given in an 
appendix) there obtains 


Q? M 7 do w 
e= - ep(-2 T aao 4) х 
4л Qo IM. w U 


|К. (2, o, |? 1 1 хс 1/2 
х ———————[————————————àijii-cl-— . (69) 
1 + (Шш)? a —My^ [1 rone ( e ) ] 


At small source speeds the effective range of integration for (69) is 
0 « o <.04 and if Ud «X ог xd » 1, the approximation 


2 [1 -r(xclo) ^ — cos а 


1 — cos a 


(68) 


| Кучу, о, x)|? Сӣ = (70) 
kU 


has uniform validity, whence on expansion of the other elementary factors 
depending on М? and x U/w or x c/w it follows that 
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I: [estu e ey 2 ES 
4л Qo $ 9». 
Qo? 


= 3 M?) w exp (—2 o d/U) dw 
d wl pC2od/U) 


Q? M? М «1 
= = | (71) 
32 лоо (x d}? 41 


omitting higher reciprocal powers of x d. The requirement 


ха = 200 а> 1 
с 
is indicative of substantially more inertia іп the medium than at the screen, 
using d as a length scale for the comparison thereof, and it appears from the 
estimate (71) that the weak secondary radiation due to such a limp screen falls 
off proportionally with the square of the surface mass density. If x d has a 
small magnitude the simple approximation (70) no longer suffices and. there 
is need for representing |К. (iy, œ, x)|? over a more extensive range of fre- 
quencies, with an expected divergence of e in the limit x d — 0. 
On the assumption that M ~ 1 the appertaining form (A.19) 


х U 271-1/2 
IK. iy, о, a) = L " (=) | (72) 
w 
may be inserted in (69), whence 
F(A 
NS Qo” (4) ы 
4л оо (1 — М?)3/? 
with 
rt exp (—A c 
F(A) = [99 a = ci A cos Asi Asin A 
72 -+1 
and 
A = 2 x d (1 — М?)ї?;; 
since 
In A, 4—0 
F(A) = 


Tae ды 


the previously cited features of the radiation for rigid and limp screens carry 
over to sources in rapid motion. 
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The alternative representation, 
О? M 


4л 0o 


€ 


(1—M2)-9?. x 


oo 


; CENE ae 21/2 R 
x Re [ 2 (1-а-мә") 2DE озм а ‚ (73) 
(0. 9 M [K_ ( — (0/0) (1 — M?)"?, o, x) 


which follows from (66) on substituting for f (—iy, о), is less efficiently suited 
to estimation insofar as both the modulus and phase of the complicated func- 
tion K. (—iy, œ, x) enters and also by virtue of closely comparable magni- 
tudes for the separate parts of the integral; thus, when M <1 and xd 1 
the approximate form 


xU 
[К_ (—iy, о, x)? e—— (1 + iM), w «xU 
w 


implies a cancellation between the leading terms in (73), of order (x d)", 
consistent with the result (71). 

For a monopole source in three dimensions whose track passes symmetrically 
by the edge of a compliant screen in a transverse plane the radiation intensity 
integral is 

Qo? U 
8 = —— 
(Алс)? лоо 


n 2n oo 
sin « (1 + cos B) exp [—2 (w/U) d (1 — M? sin? «)1/? 
x зп оаа f ap f ao ( B)exp [—2 (0/0) d( 7] 
d M (1 — M? sin? o) (1 — M? sin? а sin? В) 


K, [i (o/U) (1 — M? sin? о)! 2, (w/c) cos a, о, х]? 


K.. [(v[c) sin « cos B, (w/c) cos a, œw, x] 


where 
x К_(Ё;, б, o, x) 
К, 62) 0, 4) — 1+ aa Ер teg 
Co far OSES GP Ol? Kilby ba, 


Utilizing the results 
? [sin? о + (x c/w)?]*/? — sin a cos f 


sin æ (1 — cos f) 


o. w 
K_{ — sin «cos B, — cos a, w, x 


[4 [4 


хс 1 
ы — ————————,  o«xc 
o sin а (1 — cos В). 


and 
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2 w o«xU. 


xU М «1 


Eo w 
K, (2 (1 — M? sin? «)*/?, — cos a, w, =) 
с 


it follows that for low source speeds and a limp screen 


QU F, T 1 о Xo o 
w ————— J sin? ada f sin? Bag f exp —2 — d |— — dw 
(4ле)? 700 c d E U /xcxU 
Qo? M? 
M«l xd»i, (74) 


B блоо (* d)? d’ 


which is an order of magnitude M/(x d)? smaller than the amount of radiated 
energy that obtains with a completely rigid screen. A similar reduction and, in 
particular, the proportionality of radiated energy to the square of the mass 
density of the screen, may be anticipated for dipolar and quadrupolar primary 
sources. 

Turning attention again to the line source and assuming that the latter pro- 
ceeds along a trajectory which is parallel, rather than normal, to the plane of 
the screen, the secondary source transform becomes 


Qo exp (—y d) 1 K, (eU, о, х) 
F (ly, 0) = M 
eoU t — oU (әјс— (1) (w/e + oU? K (Ls Ф, ә) 


with the same functions K, that enter into the prior expression (63) for the 
normal case. After substituting for f [(w/c) cos х, w] in (65) the radiated 
energy proves to be 


Q? М 7 do w 
= ы [—ex -25 aaa) x 
8л? 001+ M FE U 
1 4-cos« K,(oJU,o,x) |? 


(75) 


2 : (1 — M cos a)? |K_[(w/c) cos a, о, x] 


and if x = 0, or K, = 1, the equality of (75) and (67) (established by reference 
to the integral (56)) indicates that the orientation of the trajectory is without 
consequence when the screen is rigid. Inasmuch as 


(© y 1+M 
K| —, 0, х 
U 


T LFM? F 0 Ооу 


w 
N с> М «1, wcxU (76) 
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the estimate (71) for the radiated energy in the case of a limp screen is likewise 
applicable to both normal and parallel source trajectories. 
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Appendix 
The resolution of 
и х> 0 
О штен ЛЫ раар ыо ы) 
considered as a function of the complex variable ё, into a quotient 
KÇ, v, x) = ERU (A.2) 
K+ (Ë, о, x) 


whose numerator and denominator are regular (and non-zero) in the respective 
half-planes Im ¢ < е, > —e, is vital for the analysis of diffraction from a 
compliant screen; and a similar problem arises in electromagnetic field. theory 
when a boundary condition of the impedance type is enforced at the screen ^). 

Crighton and Leppington furnish details of a product decomposition for the 


closely related functions, viz. 
2 


1/2 
K, о, x) = (e -=) К, o, #) 
c 


= K_(, o, 22] K,(, o, х), (А.3) 
with a central role accorded to ће sectionally analytic functions Р, (5) that are 
involved in the additive resolution 


w? -1/2 
(2-5) ->”-©+һф 76 


2 
ы 


and have the explicit form 


2\-1/2 1/2 
*-©=—(е—= ) arctan (2 1 ‚ Р,(0)=Р.00). (A5 


с? о[с— 


А compact representation of the factor K_ is established, namely 


-io 


xP4G)/2 
Ko, = (и) E9 exp( | E (A.6) 
с, | 


[д т? — и? — w?/c? 
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w 1/2 
p= (= + 2 
с 


though the especially simple version which к | assumes if ¢ is real does not 
receive (or warrant) specific attention insofar as the asymptotic behavior of 
K_ or K_ matters. Details of such a nature, with evident relevance to the 
analysis of sec. 5, follow more readily from an initial characterization (that i is 
based on the Cauchy integral formula), 


In KC, œ, x) = In K (5, œ, x) —In K,(¢, w, x) = 


] x dz 1 T x dz 
= — н) rud rm) 
ni (22 — 002242) z—¢ 2ni) (22 — 02|с2)42) z—t 
> К: 


(АЛ) 


where the infinite contours of integration lie within the strip [Im [4 | < в and 
pass above or below the point 2 == £, as indicated. Thus, 


Ko м) = [K (E o, - (АЗ) 
апа 
Kt _ 1 К. х dz 
онн uf е 
M4 
1 

= exp (- т UC xt, x) — IE, 2) (A.9) 

with - | 
IX, x) = fin(1 ЕЕ NES 8 - (А10) 

КЕ G? — afer?) z—t с 


after deformation of the integration contour about a branch cut extending from 
z = ојс to z= œ. The functions J‘) vanish if x = 0, and, moreover, 


a "(22 — a?[c?)!/? dz 
— (I) — IO) = 2f — , (A.11) 
ox 2—0 2—{ 


о/с 


where deviations of the contour above the points 
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and 
z=€ (when ё > w/c) 


are necessary and a partial fraction development facilitates evaluation of the 


integral. 

If © is real and of magnitude smaller than w/c, 
d mix 2x€ z+ w?/c? — ё?у!? 
— (I) — JO) = — = — In E ЧЕ б) х 
dx и(ш— 6) шш? — 6) ole 62 — и? 


л б 
х (5 + arctan (we руз ; py z} (A.12) 
whence 
2/02 2)1/2 __ #\ 1/2 : 
К_(6, о, x) = (E exp (ГР) (A.13) 
wje — € 
with 
тат t+ (T? + w?/c?)1/2 a" -1 
1а шз 0 2L. 2 
f (т? + m yel п ac ) (« T E 4 ) + 
M e": А14 
= arctan (== EN zs) (z + arctan (02702 = а) (А.14) 


and the relation (68) can thus be confirmed immediately. 
If © > w/c an alteration in the final term of (A.12) is called for, viz. 


ліх 


[o] 
(I — C) = —— 
= RST 
2 __ m2/n2)1/2 2. [pot 
2x6 NC ON: c? |с?) СЕ w/c?) =) +a 
“uee ale C2 2 oe 


and it suffices to record that 


(w?/c? + x2)? —t x+ (P — w?/c?)1/2 
2 А 
. |К, w, z) E ос i: (22 Ye eo? |c?) > [a > alc. 


Another form 
° (i) — 1") 
Ox i 


mix 2x6 eiu E uela ue юс 
Ogee pA IE EVE. —————— |n ——————— 
u(u— D и (и? = o. . elc. p-— и? (02 — o? [c?y? — t 


ACOUSTICAL DIFFRACTION RADIATION 275* 


is appropriate when ¢ < —w/c and this leads to the same expression for 
|К NOS х) as furnished by (A.13); on utilizing the connection (A.8) between 
K, and K_ the property (76) is established. 
If © is purely imaginary, say ё = —iy, 
ліх 2іху x--u зр еу 


Xe oe ши) иби? + у?) ole и + у? 


; (4,2 2/^,241/2 2 271/2 
ESSE a cao na T 
p + у? w w w 


and it follows that 


K_(—iy, œ, x) = 


arcsinh (xc/o) 


(= +x? + =] ( ус J t sinh t dc ) 
=| — exp| —— ——] x 
y? + w/c? Р ло ^. cosh? т + у c/w 


1 i x nf 75 j ус : yc 271/2 
х — ——————D n" 
exp A arctan (3 3 oey (2 3! + ч + (= ) ] | 


[; х і ( (x? + 02/с2):/2 w )| 

X exp] -arctan ————————— — - | arctan ———— arctan — | |. 

2 Q?-re?[cy^ 2 y yc 
(4.16) 


With the particular assignment 
w 
у=—(0—М?уү? 
c 


there obtains from (A.8) and (A.16) 


2 


w 
K4 (Za — М?)!?, о, «) 


arcsinh (xc/o) 


ой ( 2 амар J T sinh t dt 
= — ex —— — 
P ж М ) 


o?[U? + x? E cosh? т + (1 — M?)/M? 
| ени СС pisi M ju A.17 
х ex — — arctan ——— in _ A 
i л w ( M? A M? )| ( ) 


and for M «1, o « x U, 
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2 “(С ) (= * тіп) тат а 
mæ — | — | exp — f 
xU\2 nM > cosh? т + 1/M? 


@ 
(A.18) 


Ф 
K, (<a — М?)!, о, n) 


since 


^  csinh dc aM 2 
f => — ln —, M — 0. 


d cosh? т + 1/M? 2 M 


In the limit M — 1 the exponential factors of (A.17) tend to unity and 


2 xU 29-1/2 
-+ (5 , M—1. (A19) 


w 


w 
K (i= (1 — M?92, о, J 
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Abstract 


The well-known property that networks of linear, positive resistors 
cannot amplify currents or voltages which are supplied by energy sources 
is shown to apply also to a class of non-linear, multi-terminal, resistive 
devices. The results of previous work are established and further ex- 
tended by a very simple analysis procedure. The results are relevant to 
the behaviour of integrated circuits. 


1. Introduction 


Consider a network consisting of positive resistors, diodes, transistors and 
a single voltage source, such that all currents and voltages are time-invariant. 
Then it is a well-known experimental fact that the maximum voltage appearing 
anywhere in the network is the voltage of the source. Such a property does not 
іп general hold for time-varying voltages and currents if the capacitances 
intrinsic to the transistors can play a significant role. 

This property has been discussed theoretically and has been proved to hold 
under various assumptions in-two recent papers +’). Further it has been 
generalized by Willson ?) to include the case of several voltage and current 
sources, as a result of a useful theorem due to Wolaver ?). 

In this paper we give a much simpler approach to the problem, which im- 
mediately leads to a natural extension of the domain of validity of the basic 
property to networks containing certain types of n-terminal devices. 


2. Characterization of three-terminal resistive devices possessing the non-ampli- 

fication property 

Consider first a three-terminal network of linear, positive resistors. Such a 
network is equivalent to a delta connection of resistors. If this network is 
embedded (fig. 1) in another network of linear, positive resistors (any of which 
may be zero or infinite) and if a voltage source is connected between nodes 1 
and 3, with node 1 at the higher potential, then, as is readily verified by simple 
analysis ^, the maximum and minimum node potentials of the complete net- 
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Fig. 1. A three-terminal linear resistive network with a single source, embedded in a linear 
resistive network. Open arrows denote reference current directions, solid arrows denote actual 
current directions. 


work are the potentials of nodes 1 and 3 respectively, provided no node of the 
complete network is totally isolated. Thus 


V, > V: > Ёз, (1) 
from which it follows that 
1, >0 апа I, <0. (2) 
By Kirchhoff’s current law, 
h =—@ +h), | (3) 
so that as а result of (2), 
|z] < max [/, TAM (4) 


It is clear from fig. 1 that the current Jọ supplied by the source satisfies 
21, 2|. (5) 


Further, (1)-(5) are equally valid if the source is a current source. Thus in such 
a network, neither the voltage nor the current of the source is "amplified" 
anywhere in the network. We thus refer to the non-amplification property. 
We next consider a non-linear three-terminal resistive device. By resistive we 
mean that the defining equations of the device do not involve time in any way. 
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Then, for any set of node' potentials and currents simultaneously admitted by 
the device, it is always possible to number the nodes so that (1) holds. We 
define the device to possess the non-amplification property if for any set of 
node potentials and currents admitted by the device the following conditions 


apply: 
If 

Vi >V: > Үз, (6) 
then 

І, > 0, I, <0; * (7) 

if 

Vi > И > Ёз, (8) 
then at least one of J,, J, is non-negative and J, < 0; (9) 
if 

V, > „> V3, (10) 
then Г, > 0 and at least one of L5, J; is non-positive; (11) 
if 

V, = V, = V3, (12) 


then from Kirchhoff’s current law, at least one of Г,, h, І, is non-negative, and 
at least one is non-positive. (13) 

It is clear that we can include all these particular cases in (1) and (2) provided 
we number the terminals appropriately. With this convention, the device pos- 
sesses the non-amplification property, if and only if (1) implies (2). In the sequel 
we refer to such a device as an NA device. 

It is readily verified that bipolar and field-effect transistors, as usually model- 
led, satisfy (2) when (1) holds. Thus a transistor described by Shockley's equa- 
tions, which include the Ebers- Moll model and the Early effect, is an МА de- 
vice 1). One can also readily verify that the measured characteristics 2) satisfy 
(2) when (1) holds. In this connection it is, however, necessary to consider the 
six different ways in which the emitter, base and collector can be assigned to 
the terminals 1, 2, 3. 

We now establish some simple consequences of the definition. 


Lemma 1. A three-terminal NA device is passive. 
Thus 
P=V, L + 1 + VI 


‚= (Vi — V) l + V3— Р) 2 >0 


by (1), (2) апа (3). The converse is not true; the ideal transformer i is the simplest 
counter-example. 
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Fig. 2. Equivalent circuit of a three-terminal NA device. 


Lemma 2. For any set of terminal potentials and currents simultaneously ad- 
mitted by an NA three-terminal device, there exists an equivalent circuit of 
positive resistors such that the potentials and currents are linearly related, with 
the resistances as coefficients. 

Proof. The circuit of fig. 2 is such an equivalent circuit with 


V, — V. V — V. 
к„=———>0, Ry =———- > 0, 
1, L 
by (1) and (2). 


3. Characterization of four-terminal NA devices 


We consider first a four-terminal network of linear, positive resistors. This 
is equivalent to a complete quadrilateral of resistances, some of which may be 
zero or infinite. Let this network be embedded in a network of linear, positive 
resistors and let a voltage source be connected to terminals 1 and 4 so that 
terminal 1 is at the higher potential. Then as in sec. 2, the maximum and 
minimum node potentials of the complete network are the potentials of ter- 
minals 1 and 4 respectively, provided no node of the complete network is totally 
isolated. We can thus number terminals 2 and 3 such that . 


V, > Va > Va > Va (14) 

It then follows from (14) and by considering the cut-sets indicated in fig. 3 that 
| A29. I, <0, (15) 

1, + 1, >0, 5 +I, <0. (16) 


One of the inequalities of (16) is in fact superfluous since it is implied by the 
other and by Kirchhoff’s current law. 
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=Y 
Fig. 3. Four-terminal linear resistive network embedded in a linear resistive network. 
We next consider a non-linear four-terminal resistive device. Such a device 
is defined to be an NA device if condition (14) implies conditions (15) and (16). 
If some node potentials are equal, then as in sec. 2, we must agree to number 


the terminals so that (15) and (16) hold. The details of all these cases are 
omitted for lack of space. From this definition we obtain directly 


Lemma 3. For any set of terminal potentials and currents simultaneously admit- 
ted by a four-terminal NA device, there exists an equivalent circuit consisting 
of linear, positive resistors, such that the potentials and currents are linearly 
related, with the resistances as coefficients. 
Proof. The equivalent circuit of fig. 4 satisfies the conditions, with 

V,—V, У, —– Из И. — V3 


Ю„=——>»0, К„=——;>»0, R=—— >], 
D n RER l 9 


by (14), (15) and (16). 


4. Characterization of n-terminal NA devices 


It is now clear how the considerations of secs 2 and 3 should be generalized 
to n-terminal networks. We define an n-terminal device to be an NA device if 
the conditions ; 

Vi > Vu (=1,2,...,2—1) (17) 


Fig. 4. Equivalent circuit of a four-terminal NA device. 


imply the n — 1 independent inequalities 


J 
Yn20 (7=1,2,...,р) | (18) 
К= 

апа н 
У,<0 (т=ғ +2, 3,..., п), (19) 
k=m 


where r = n/2 if n is even and r = (n + 1)/2 if n is odd. 

- That this definition is consistent with simple non-amplification properties of 
n-terminal networks of linear, positive resistors, is a consequence of the fol- 
lowing lemma. 


Lemma 5. Let an n-terminal network of linear, positive resistors be embedded 
in an arbitrary network of linear, positive resistors. Let a voltage source be 
connected to terminals 1 and n of the n-terminal network, such that node 1 
is at the higher potential. Then the other terminals can be numbered such that 
(17) holds, and (17) implies (18) and (19). 
Proof. The maximum and minimum potentials in the complete network are the 
potentials of nodes 1 and п respectively ^), so that the terminals can be number- 
ed such that (17) holds. The n-terminal network is equivalent to a complete 
n-gon of linear, non-negative resistances. Consider in this equivalent circuit the 
set of branches {| j} each of which is incident to one of the terminals 1, 2, 
..,j and to one of the terminals j + 1, j + 2, . . ., п. Denote these two sets 
of terminals by S, and S, respectively. Then the current in any member of 
{b | j} is either zero or flows from a member of S, to a member of S, 
because of (17). It then follows by Kirchhoff's current law, that the cüt-set 
formed by (b | j}, and the resistanceless branches that form the connections 
between the nodes of S}, the embedding network and the voltage source, con- 
strains the terminal currents to satisfy (18) for j = 1, 2, : . ., n — 1. From this 
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relation it follows by Kirchhoff’s current law that (19) holds for m = 2, 3, 


..., п. Since only n — 1 of all these inequalities are independent, (18) en (19) 
with their restrictions on j and m suffice. 


This lemma is illustrated for п = 5 in fig. 5, in which the actual current 
directions are indicated by solid arrows, while the reference current directions 
are indicated by open arrows. 


From lemma 5 we immediately obtain an equivalent circuit in the form of a 
linear tree. 


Fig. 5. Five-terminal linear resistive network embedded in a linear resistive network. 


Lemma 6. For any set of terminal potentials and currents simultaneously ad- 
mitted by an n-terminal NA device, there exists an equivalent circuit consisting 
of linear positive resistors such that the potentials and currents are linearly 
related, with the resistances as coefficients. 
Proof. Given (17), construct a linear tree of resistances such that the resistance 
R; is incident to nodes j andj + 1 (j = 1, 2, ..., п — 1). Then, as in the proof 
of lemma 5, the cut-set consisting of R, and the connections of nodes 1, 2, 
. S, J to any external or embedding network, ensures with Kirchhoff's current 


law that the current in R, flows from node j to node j + 1, on account of 
(18) and (19). Thus, 
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Fig. 6. Equivalent circuit of a five-terminal NA device. 


R20, ј=2,3,...,п—1. 


The result is illustrated for n = 5 in fig. 6. 


Corollary. An NA device is passive (this follows immediately from the equiv- 
alent circuit). 


5. Networks consisting of sources and n-terminal NA devices 


For п = 2, an NA device is simply a passive resistor, for example a linear 
positive resistor or a semiconductor diode. We can now state and prove the 
principal property of networks consisting of NA devices and sources. 


Theorem 1. In any network of NA devices, current and voltage sources, in 
which no node is totally isolated and for which the complete set of network 
equations possesses a solution, the magnitude of the voltage between any two 
nodes in the network is bounded above by the sum of the magnitudes of the 
voltages across the sources; the magnitude of any terminal current of any 
NA device is bounded above by the sum of the magnitudes of the currents 
flowing in the sources. 

Proof. Since the complete set of network equations possesses a solution, each 
NA device simultaneously admits a set of terminal potentials and currents, and 
all such potentials and currents satisfy the network equations. Then we can 
replace each NA device by its equivalent circuit and we obtain a network of 
sources and passive resistors. We divide all branches of this network into two 
sets. The first set consists of all resistors and all open branches between pairs 
of nodes. All such branches are passive. The second set consists of all source 
branches, some of which are active and some of which may be passive, i.e. 
these sources work as energy sinks. 

Then by Wolaver’s “two-basket” theorem ?), 
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[0,1 < X [0%] (20) 
8 ^ 


and 
[А < 2; RAR (21) 
B 


where the branch voltages and currents are denoted by U and J respectively, 
a refers to any member of the first set and f is taken over all members of the 
second set. ; 

Inequality (20) is the first part of the theorem. Furthermore, we note from 
the equivalent circuit of fig. 6 that if J, is the branch current flowing from 
node r to node r+ 1 (r = 2, 3,..., n — 1), then 


1, = — Jas 
where Јо = 0 and J, 20 (r—1,...,n— 1). 
Then 
ГА $ тах [Л J,-1); (22) 


and (21) and (22) establish the second part of the theorem. 


Corollary. The maximum and minimum node potentials in the network are the 
potentials of nodes incident to sources (the contrary assumption would imply 
a violation of Kirchhoff’s current law). 


6. Remarks 


(i) The restriction that the network equations have a solution is not trivial. 
The simplest counter example is a diode connected to a current source. The 
diode does not admit every value of the source current. However, it has been 
shown 5) that the equations of networks consisting of voltage sources, linear 
resistors, semiconductor diodes and transistors described by the Ebers-Moll 
model always possess a solution. The fact that such networks obey the NA 
conditions enables the proof of the existence of a solution of the equations to 
be shortened. At the same time the proof becomes valid for a wider class of 
devices than the strict Ebers-Moll model ?). 

(ii) Thermionic diodes and triodes are not NA devices over the complete 
range of their characteristics when the heater port is ignored. The diode when 
regarded as consisting only of the anode-cathode port, can be active since 
electron current can flow when the anode potential is lower than the cathode 
potential. Similarly, the thermionic triode when grid current is permitted can 
violate the NA condition. Even when the heater port is taken into consideration 
the thermionic diode can violate the NA condition. Thus in fig. 7, by a suitable 
choice of А > 0, we can obtain V, < V, < V, while J, >0. 

(iii) One might question whether the definition of non-amplification is un- 
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Fig. 7. A thermionic diode in a configuration demonstrating that it is not ап NA device. 


necessarily restrictive when (18) and (19) are taken as the basis of the definition. 
We might, for example, take as definition condition (17) and 


l| Sto, К=1,2,...‚п, (23) 


where Го is the current delivered by a source connected between terminals 1 
and n, while the NA device and the source are embedded in a network of 
linear positive resistors. However, we can construct a network of three such 
devices, with n > 4, and a single source, in which the source current is not 
the largest current present. This violates theorem 1, which we particularly 
want to maintain. Thus the definition would need to be sharpened and it is 
not evident that we could find a definition which admits a wider class of net- 
works than (17), (18) and (19) do, while simultaneously preserving the validity 
of theorem 1. 


7. Conclusion 


We have shown that the characterization of multi-terminal non-amplification 
devices and the establishment of the non-amplification property of networks of 
such devices is basically very simple. 

It is natural to ask what is the use of results of this kind. The author’s first 
serious contact with the non-amplification property arose in attempts to obtain 
some general results regarding the distribution of, currents and voltages in 
integrated circuits. The uniqueness of the solutions of the network equations 
of such circuits is of great practical importance, but a proof is in general very 
difficult to obtain. One can, however, often conclude something about the 
local uniqueness of solutions by investigating whether the equations have a 
solution in a particular region of ће Z-V space. An important tool in such 
investigations is the Leray-Schauder theorem and its consequences ?), which 
in the network context involve showing that at every point on the boundary 
of the region under investigation, the power absorbed by the passive devices 
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always exceeds the power absorbed by certain voltage and current sources, 
whose strengths are linear functions of the strengths of the actual sources. If all 
the resistive components of the network are NA devices, then the establishment 
of this power relation can be considerably simplified. In such analyses it may 
be convenient to split the network up into a connection of NA devices in 
various ways. The n-terminal NA devices are then appropriate. This approach 
can be useful in analysing networks containing artificial transistors each of 
which is realized as a cluster of integrated transistors. 
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Abstract 


The method of the fast fourier transform is explained and three proce- 
dures in ALGOL 60 are presented. 


1. Introduction 


Since the first publication +) on the fast fourier transform a lot of papers оп 
this subject have been published, dealing with the algorithm itself, with its 
applications and. also with hardware implementations of it. Nevertheless we 
could not resist the temptation to add another paper to the existing ones. It - 
does not contain essentially new results; its merits, if any, must originate from 
the way the method is introduced and from the simplicity of the resulting 
procedures. In secs 2 and 3 the fast fourier transform will be explained for the 
case that the vector length is a power of 2. In sec. 4 an ALGOL 60 procedure 
is presented for that case. In sec. 5 an alternative view is given, leading to a 
slightly different procedure. In sec. 6 we deal with the case of more-general 
vector lengths. Section 7 concludes the paper with some remarks. 


2. Divide et impera 


The fast fourier transform is a method of computing 
N-1 


b, = У, a, exp (2лї k j/N) {j=0MN—-1}, (D 


especially for the case where N = 2". In that case the transformation requires 
тх N operations (compared to NxN for the straightforward calculation of 
the sums). 
The essential idea of the method is rather simple and can be presented in the 

following way: 

N-1 

b, = У, & exp [2лі К j/(N/2)] 
k=0 
N/2—1 
= У, (а, + а,+ мг) exp [ri К j/(N/2)], 


k=0 


N-1 Q) 
һ»у = У, exp Qni k[N) а, exp [27i К j/(N/2)] 
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N/2—1 


= p» exp (2лї k/N) (ay — а, мг) exp [2лї k j/(N]2)], 


k=0 


| {j = 0 (1) N/2—1}. 
Introducing the vectors 


®к,о = Ak + към 
{Е =O(1)N/2—1} (3) 
Vy,, = exp Qzi k| N) (а, — а, n12) | 


we can write (2) in the form 


N/2—1 
b = vy, xp [2лї Е j/(N/2)], 
{7 = 0 (1) N/2—1} (4) 
N/2-1 wh 
һәы = 5; о, ехр [Bai k j/(N/2)]. 
к= 0 


Both right-hand sides in (4) are again fourier transforms, and we therefore have 
reduced. by (3) and (4) the problem (1) to two separate problems of the same 
form but of half the size. Each new problem can be further reduced to two 
problems with a quarter of the original size, and by repeating the procedure 
m times we arrive at N problems of size 1, i.e. N sums of one term each, which 
are trivially elaborated. We illustrate this procedure in fig. 1. 


Fa 
à 


i 
i 
i 
i 
i 
: 


Fig. 1. Successive reductions of problem size in 


For each new problem, the coefficients not only have to be calculated, but 
also to be stored. If we try to use, for that purpose, the vector whose elements 
are a, we are, by (3), invited to overwrite a, by о, о, and ак,» by vy. 
{k = 0 (1) NJ2 — 1). As a result, all the coefficients for computing 55, are 
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stored in the first half of the vector, and all the coefficients for computing 
Б... in the second half of that vector. The separation of the two new problems 
is therefore trivial and the process can be easily repeated: 


procedure divide (a, nq, p); value nq, p; integer nq, p; complex array a; 
comment a[p], ..., a[pA-nq— 1] contain the coefficients of a fourier transform of 
order nq; 
begin integer k, ng2; complex twopin, 21, 22; 

twopin:= complex number (0, 6.28318 53071 796 | nq); nq2:— nq div 2; 

for К:= 0 step 1 until ng2 — 1 do 

begin z1:— a[p+k]; 22:= a[p4-k4-ng2]; 

a[p--k]:— zl + 22; a[p--k--nq2]:— exp (kx twopin) x (21 — 22) 

end; 

if nq2 > 1 then begin divide (a, nq2, p); divide (a, nq2, p + nq2) end 
end divide; 


A simple call divide (a, n, 0) then suffices to replace the coefficients a,, stored 
in the vector elements а[0/,..., a[n — 1], by their fourier coefficients Ру. 
These b; are, however, not delivered in their natural order. All b’s with an 
even index, e.g., are to be found in the first half of the vector (this is brought 
about in the first reduction step апа not disturbed by any later reduction). 
Therefore a reordering is still required. We return to this point in the next 
section. 

We conclude this section by introducing a notation for the elements of the 
vectors in the problems displayed in fig. 1. Let for q = 2', t = O(1) m, the 
coefficients of the Ath problem on the tth level be denoted by 09%, We then 


have 
N/q—1 


buen = Y, v, exp Dei kN), (5) 
T ( —0() М4 —1,h—-0()94—1). 
By writing 
N/2a-1 
. basa = У, (of, F UN )ехр [2лї k j/(N/24)], 

k=0 

N/2q-1 

Боау+а+һ т у, exp [2лї k/(N/@)] (000 РРР )ехр рлі k jIcNI24)) 

k=0 


we obtain the recurrence relation 


(20) (а) @ 
Ven mE (о F Ve N/2a,h ) 
А (a) (4) 
20200 a = exp [2лї k/(N/q)] (vo, —– 090 v iac. n ), (6) 


{k = 0 (1) N2q4—1, h = 0 (1)g— 1}, 
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which starts with 

và = ap {k —0() N—1) (7) 
and ends with 

290 = bp {А = 0(1) N—1}. (8) 


Note that (1) and (4) are special cases of (5), for q = 1 and 2, respectively, 
and that (3) is a special case of (6), with vpn = 202), 

Each complete transformation from v{ to v2” requires, if properly 
ordered, N/2g exponentials, N/2 multiplications and № additions or subtrac- 
tions. The complete fft requires N — 1 exponentials, m N/2 multiplications 
and т N additions or subtractions. Note that in the procedure divide given 
above, however, the number of exponentials equals the number of multiplica- 


tions. 


3. Madness, yet there is method in it 


In the foregoing section we saw how in the fft algorithm repeatedly two 
elements of the vector are replaced by new values. 
For the index p(2, of the coefficient v?) in the vector a, we get, according to 


the recurrence relation (6), the following recurrence relation: 


Qa) __ (a) 
Pkn = Pk 


(20 .. a 
Dk,ha = Рк+м/2да„һ› 


{k = 0(1) №24 — 1, h —0()4—1), (3) 


starting with 


Pko = К, (k20(0)N—1) (10) 
As рд is linear in k, and (9) does not disturb this linearity, we may put: 
N 
рең = k + ri, {k = 0(1) Niq—1,h=0(1)q—1}, (1) 
q 


(for constant Л the р) form segments of length N/q) with 


s? = 212, 
{^ = 0()9— 1), (12) 
50-2109 +1, 
апа : | 
20 = 0. (13) 


The numbers гк? therefore are exactly the so-called bit-inverses of A with 


respect to q, i.e. if we write A in the binary number system as a number of 
log;g binary positions, we get г? just by reversing the bit string for h 
(compare the way in which the reversion of a string of t + 1 bits is expressed 


in terms of the reversion of strings of ¢ bits with the recurrence relation (12)). 
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At the last level, we finally have: 


b, = 10%, {h = 0(1) N—1}, 


and we will find b, in the element with index r{. We must complete, there- 


fore, the algorithm for fft by a reordering, in which the elements with index Л 
and r(" are interchanged (note that if r(? = s then r(? = h). This can 
easily be done by the call invert (a, 1, 0, 0) of a procedure invert with decla- 


ration: 


procedure invert (a, q, h, т); value q, h, r; integer q, h, r; complex array а; 
comment r contains the value of r,(?); 
begin ifq < N 
then begin invert (a, 2xq, h, 2xr); 
invert (а, 2Xq, h +9, 2xr + I) 
end 
else if h «r 
then begin complex z; 
2:= a[h]; a[h]:= afr]; a[r]:= z 
end 
end invert; 


The number of operations involved in a cali of invert is linear in N (there 
are executed, e.g., N/2 interchanges and 2N — 1 tests “qg < N”). 


4. A thing of beauty 


Below we present ап ALGOL 60 procedure (fftcom) for the fast fourier trans- 
form based on the discussions of the previous sections. The differences with 
the algorithms given above are the following: 

(1) Instead of complex arithmetic, real variables and real expressions are 
used throughout the procedure. The arrays ar and ai are used for the real and 
imaginary parts of the coefficients involved. The exponentials are replaced by 
cosine and sine functions. 

(2) Instead of recursion, iteration is applied, except in the inversion process. 
The controlled variable пд (for N/q) is used to iterate over the levels; the 
variable j iterates over the problems of one level; the variable k is used to 
iterate over the elements of one problem. By changing the order in which j 
and k are incremented (k in the outer loop and j in the inner loop) we can 
use each exponential exp (2zik[ng) q times. Moreover, for each value of k, 
j is chosen to step along the kth element of the problems rather than along 
the problem numbers themselves (these elements are found at the indices 
k + a multiple of nq). 

(3) The local procedure invert has 2 parameters instead of 4. The omission 
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procedure fftcom (ar, ai, m); value m; array ar, si; integer m; 
begin integer n, ng, ng2, j, k; | 
real twopin, an, c, 8, гі, 11, r2, 12; 
procedure invert (q, h); value q, h; integer q, h; 
begin if q <n 
then begin invert (2 X q, h); invert (2 x q, h * q) end 
else begin if h < К then | 
begin r1:= er[h]; ar[h]:= ar[k]; ar[k]:= r1; 
it:= аї[һ]; ai[h]:- ai[k]; ai[k]:= 11 


kek +1 
end 
end invert; 
n= 2 À m; 
for ng:- n, па2 while па > 1 do 
begin ng2:= па : 2; twopin:- 6.2831853071796 / па; 
for k:= па? — 1 step — 1 until 0 do 
begin an:- twopin X К; с:= cos(an); s:- sin(an); 
for j:= n — nq + k step — nq until k do 
begin г1:= ar[j]; 11:- ai[j]; r2:- ar[j + 192]; 12:= ai[j + ng2]; 
ar[j]:s r! + r2; аї[3]:= 11 12; 
Pi: rj] — r2; 11:= 11 — i2; 
ar[j + па2]:= r1 X e - i1 X s; ai[j + п02]:= r1 xs 11 Xc 
end for J 
end for к 
end for ng; 
k:= 0; invert (1, 0) 
end fftcom; 
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of the first parameter of the previous version of invert is trivial. The omission 
of the parameter r, however, is harder to explain. Its role is replaced by the 
variable k, global to invert, which is initialized to 0 and is incremented by one 
at each level of maximum depth, regardless of whether on that level two 
elements of the vector are interchanged (Л < k) or not. That this replacement 
is correct can be proved in two steps: 

(a) A call invert (l, h) increments k by NJI. 

For, this is true for / = N; moreover, if it is true for / = 20, it is true 
for l = 9, as N/2q + N/2q = №. 

(b) At procedure entrance, the value of the formal parameter r in the old 
version, and the value of k in the new version, are related by k = (№4) xr. 
For, it is true at the start, where we have invert. (a,1,0,0) in the old 
version, and the statement pair k :— 0; invert (1,0) in the new version; 
moreover, if it is true at a level different from the deepest level, it holds 
for the next level: at the first internal call, both g and r are doubled and 
the value of k is still unchanged; at the second internal call, k has been 
incremented by N/2q, and again we have k -- N/2g = (N/2q)x(2r + 1). 

As a special case, at the deepest level we can replace r by К. ~ 

With these remarks we hope to have made the procedure fftcom sufficiently 
clear. 


5. The way is the same, but the order is inverted 
There is a different scheme for fft, which can be programmed as easily as the 
scheme we just discussed. Consider the inverse transformation: 


N-1 
А =< Y, b;exp (oni ЕДЮ), (k—0()N—1, Q4 


J=0 


and apply the method of sec. 2 to it: 


i №а-1 
ашъ = 72 w® exp [—2лї k j/(N/q)], (15) 
{k =0(1) №9 — 1, А =0(1)q—1} 
with 
WEP = (WIR HWE nan) (16) 


Wara = + exp [—2л? jj i, a) (00% — WÈ ynan) 


А А {7 = 0 (1) N24 —1, = 0(1)q—1}, 
which starts with 


who = by, . F=0MN-1}, (17) 
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and ends with 


(we have honestly distributed the m factors + contained in 1/N over the 
m recurrence relations (16)). Í 

Again we can use as index for the coefficient w{? the previously derived 
Df, and we get the coefficients a, delivered at the positions r{. 

Now, by inverting the whole transformation expressed by (16), (17) and (18) 
we get a new scheme for the fourier transform: 

First, start with the coefficients a, in their normal order and reorder them 
in such a way that а, can be found at the index rf. Consider them to form 
N segments of length 1. Next, combine repeatedly two adjacent segments of 
the same length to one segment of double length according to the recurrence 


relation 


wen = Wha + wy, exp [271 jIQNJa)]. 


wi муза = ин > ЖР exp [2л j/(N/q)], 


{7 = 0(1) N/2qg—1,h=O0(1)g—1} (19) 


until one segment of length N is obtained. Then, b, = w$}, and the b, are 


delivered in their natural ordering. 

We can formulate this recombination process recursively by means of the 
procedure combine in the following way: 
procedure combine (a, nq, p); value nq, p; integer nq, p; complex array a; 
begin integer j, nq2; complex twopin, 21, 22; 

twopin: — complex number (0, 6.28318 53071 796 | nq); nq2:— nq div 2; 

if nq2 > 1 then begin combine (а, nq2, p); combine (a, nq2, p + nq2) end; . 

for j:= 0 step 1 until ng2— 1 do 

begin z1:— a[p4-j]; 22:= a[p4-j 4-nq2] x exp (jx twopin); 

a[p4-j]:—9 21 + z2; a[p-Hj--nq2]: 9 z1— z2 

end 

end combine; 


The complete fft is then carried out by the statement: 
begin invert (a, 1, 0, 0); combine (a, N, 0) end. 


Instead of this, we present the ALGOL 60 procedure fftcoml (see below), 
to which the three remarks given in sec. 4 apply almost unchanged. 

We have, from a computational point of view, no preference for one of the 
two methods for fft as embodied in the procedures fftcom and fftcoml. This 
is no longer true for generalizations of fft, as we shall see in the next section. 
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procedure fftcom! (ar, ai, m); value m; array ar, al; integer m; 
begin integer n, nq, ng9, jy К; 
real twopin, an, c, s, гі, 11, r2, 12; 
procedure invert (q, h); value a, h; integer a, h; 
begin rq <n 
then begin invert (2 x q, h); invert (2 X q, h + q) end 
else begin if h < k then 
begin г1:= ar[h]; ar[h]:= ar[k]; ar[k]:= r1; 
11:= аї[һ]; ai[h]:= ailk]; ai[k]:= i1 
end; | 
k:=k +1 
end 
end invert; 
п:= 2 А ms 
k:= 0; invert (1, 0); 
for ng2:- 1, nq while ng? <n do 
begin nq:= 2 X ng2; twopin:= 6.2831853071796 / nq; 
for ј:= ng? — 1 step — 1 until 0 do 
begin an:- twopin X j; c:= cos (an); s:= sin (an); 
for k:= n — па + j step — nq until j do 
begin r1:= ar[k + ng2]; 11:= ai[k + nq2]; 
r2:= r1 X c — i1 X 5; i2:= rixs + il Xo; 
ri:= ar[k]; 11:= ai[k]; ar[k]:= r1 + r2; ai[k]:= 11 + 12; 
ar[k + па2]:= ri — r2; ai[k + ng2]:- 11—12 


end for k 
end for j 
end Tor mae 
end fftcom!; 
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We state without proof that 
vi) = exp (2лі h|N) x wl 
forg —1,2, ..., N, к= 0 (1) N/g—1 and л = 0 (1) 4—1. 
6. Be wise, generalize 


In this section we consider the fast fourier transform (1) for the case that 
the base N is a product of m factors ру, ..., Pm, Which are not necessarily 
equal: 


N = p,Xp2X...XDm (20) 


The generalization of the formulae in the preceding sections is straightforward. 
We introduce the following notation: 


4: = р, XP2X. . .XPı-15 


{t =1(1)m} 
lt Pe = р, Хр Х...Хр,-:Хр,, 
for products of factors of N, and 
[4:] = (Pı, P2. +> Dici) 
{t = 1(1) т) 


[qs рг] = (Pis Pas « + Pio Po) 


for ordered sequences of factors of N. Special cases, for t = 1 and t = m, are: 


[gi] — = (€) 


and 
N = Gm Pm = р, Х...Хрь 


[N] = [dm Pm] = (Pis t «> Pm) 


respectively. In the following formulae, however, we shall often omit the sub- 
script ¢ and write, e.g., g in stead of q, in order to improve readability (and 
printability). 

We then have on one hand: 


N/a—i 


bath = у САП exp [2лї k j/(N/q)], 
k=0 
{7=0(1) M/q—1,h =0(1)g—1}, (5a) 


p-i 


оканда = ехр [2лї k IQN]a)] Y, visas, exp Олі g Пр), 
g=0 


{К =0(1) N/gp—-1,h—0()g—1,1—0()p—1), (ба) 
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Wü = ap {k=0(1)N—1} (а) 

VN = bp {h=0()N—1} (8a) 
N 

Pen = m {Е=0(1)М]4—1,%=0()4—1}, (ila) 


таа = P n + (—0(0)9—11—0()p—1»5 (12a) 
rj? —0, (13a) 


where we start the transformation process with the coefficients ак in their 
natural order (рф being k) and after m steps arrive at the coefficients b, 
stored at the indices гт. 


On the other hand we have: 


Work = а, {fh=0(1)N—1}, (18a) 
p-1 

ий usa = У, wierd exp Doi 1 + g N/ap)(NIa)], (19а) 
1=0 


иа = by, {j=0(1)N—1}, (17а) 


where we start the transformation process with the coefficients a, stored at 
the indices rI"! and after m steps arrive at the coefficients b, stored in their 
natural order. Again we have: 


vigi? = exp Qi k А/М) x wire? 


In (19a), the argument of the exponential is proportional to the summation 
index /. We can, therefore, easily compute the right-hand side of (19a) using 
Horner’s scheme for the evaluation of a polynomial of degree p — 1 in 
exp [2zi (j + g N/qp)/(N/q)]. The elaboration of the right-hand side of (ба), 
on the other hand, seems to require an extra exponential and an extra multi- 
plication. For that reason we have chosen the second scheme (18a), (19a), (17a) 
for our procedure gftcom. 

The body of the procedure gftcom is composed of three inner blocks. The 
first of these blocks factorises the parameter n into its prime factors and delivers 
both the number of factors (in m) and the factors themselves (in fH], .. ., 
f[m]). 

The second inner block of gftcom permutes the input coefficients. This is 
more complicated than before since the permutation now usually contains cycles 
of length larger than 2. For simplicity we first generatethe permutation and store 
it in the array rk (in such a way that rkr] = h). Тегене; all the cycles 
are applied once and once only. 

The third inner block carries out the т successive recombinations. The variable 
ngp takes on the values 1, Pms PmPm-1> +++» Pm+++P23 the variable ng as- 
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procedure gftcom (ar, ai, п); value п; integer п; real array ar, ai; 
begin integer m, p, nq, пар, t, 3. J, К, 1, jenap; 
real twopin, an, е; 5, W, wr s Wi; 
integer array f[! : 1n (n) x 1. в]; 
begin m:= 0; па:= n; t:- 
for p:= 2, 5, 5 step t until пар до 
begin try: пар:= "nq : p» 
if p X nap = nq In]: 
then begin m:- m 4 1; f[m]:- p; nq:= пар; goto try end; 
tiz6—-t 
end for p; 
if ng > 1 then begin m: m + 1; f[m]:- па end 
end : factorization of n; 
begin integer array rk[o : n — 1]; 
“procedure gen perm (a, n); value а, h; integer q, h; 
begin integer 1; 
TIF а<п 
then begin t:= t + 1; 
for 1:= 0 step 1 until f[t] — 1 do 
gen pern-(F ft] xq, h+1xq); 
t= + —1 
end 
else begin rk[k]:- h; k:- k + 1 end 
end gei gen perm; 
Kis t:- 0; gen perm (1, 0); 
for k:- 1 step 1 untiln — 2 do 
begin ј:= rkTk]; 


SEN Я [3]; [3]; rk[k] 
then begin wr:= ar[j]; wi:= ai[j]; rk[k]:- К; 
for 1:= rk[j] while 1 4 j do 
i Begin п ar[j]:= аг[1]; ail §]i= ei[1]; rk[j]:= J; J:= 1 ends 
агіјі:= чт; а]: wi 
end j tk 
end for k 


end p permutation of elements; 
begin real array r, i[O : fim) - 1]; 
t:- m; 
for ngp:- 1, nq while nap <n do 
begin p:= f[t]; t:- t — 1; nq:= пар X p; twopin:= 6.2831853071796 / na; 
for k:= 0 step nq until n — nq do for j:= 0 Step 1 until пар ~ 1 do 
begin 1:= р ~ р=1; 
for jgnqp:- j + па — пар step — пар until j do 
begin г[1]:= ar[jgnap + К]; 1[1]:= eiljenap + ЖОК]; 1:2 1— 1 end; 
for jgnqp:= j + nq — пар step — пар until j do 
begin an:= (if jgnap < ng/2 then јепар else Jjgnap — nq) x twopin; 
c:- cos (sn); s:= sin (an); 
we:= r[p — 1]; мї:= ilp 1); 
for l:= p— 2 step- 1 until O do 
Begin мг1:= wr; 
-wixc-wixs + r[1]; i: wI Xs +w X c  i[1] 
ind for 1; 
ar[jgnqp + k]:- wr; at[jenqp + k]:- 
end for jgngp 
end f for k k and for j 
end fi for ne nqp 
end г recombinations 
end g gftcom; 
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sumes the values Pm, Pu Pu-1; +++» №; the variable k runs over the values of 
ngxh {h = 0 (1) p — 1) and the variable jgngp accepts the values j + gxngp 
{g = 0(1) p— 1). For each value of j and h (or, equivalently, j апа К), we 
have, according to (19a), to replace p elements of the vectors ar and ai by 
new values. For that purpose, these elements of the vectors are first copied into 
two working vectors r and i; next the new values are computed using Horner's 
scheme and then stored in the right locations in the arrays ar and ai. In the 
computation of the exponential, the argument 2z xjgnqp[ng is replaced by 
2л x (jengp[ng — 1) if it is larger than x, for higher accuracy. 


7. Discussion 


In the preceding sections we discussed two sets of formulae for the fast fourier 
transform: in sec. 2 we introduced the so-called Sande-Tukey scheme ?) and 
in sec. 5 we dealt with the Cooley-Tukey version +). Both methods were first 
presented for the special case that the vector length is a power of 2, and were, 
in sec. 6, generalized to arbitrary vector lengths. 

There are a number of refinements that we did not discuss so far, for reasons 
of simplicity: 

(1) We did not discuss the reduction in the number of multiplications that 
can be obtained when exploiting symmetries in the trigonometric functions or 
when combining factors 2 to factors 4 as much as possible. According to the 
algorithm gftcom of sec. 6, each factor p, of N leads to NX(p, — 1) complex 
multiplications. Therefore two factors 2 lead to 2N complex multiplications 
and one factor 4 to 3N ones. Using the special values of exp (2zi k/2) and 
exp (2zi k/4), however, these numbers can be reduced to № and 3/4, re- 
spectively. For odd factors p, one can, using the fact that cos [2x (n — k)/n] 
= cos (2x k/n) and that sin [27 (n — k)/n] = —sin (2z k/n), reduce the number 
of complex multiplications by a factor of between 2 and 4 (see e.g. ref. 4). With 
the help of these additional provisions one can speed up the fast fourier trans- 
form, though not essentially. The result is a much longer procedure, which is, 
at least from the viewpoint of program design, less attractive. 

(2) We did not discuss the implications of the use of virtual memory systems 
(see e.g. ref. 3). Note that the recursive procedures divide (sec. 2) and com- 
bine (sec. 5) lead to memory accesses of a more systematic order than that of 
the iterative set-ups in fftcom and fftcoml. 

(3) We did not discuss how one can reduce the amount of additional working 
space in the procedure gftcom of sec. 6, which is, during the permutation of the 
elements, of order N. If N can be factorized in the form of a palindrome, i.e. 
N —p,Xp3X...Xpg With p; = Pm+1-1 {i= 1 (1) m}, then all the cycles in the 
permutation are again of length 1 or 2 and we can again use the method from 
the procedures fftcom and fftcoml to perform the permutation. This is based 
on the fact that quite generally it holds that: 
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г(?ї' Pm) =ke к?т, Р) =h. 


Singleton °) has described how square factors can be used to reduce working 
space and at the same time improve the regularity in the order of memory 
accesses during the permutation of the elements, its costs being the necessity 
of moving the elements more than once. 

(4) We did not discuss applications to real transforms, convolutions and 
autocorrelations. 

All these aspects can be easily analysed from the papers cited. Our presenta- 
tion has been influenced by a paper of Hopcroft 5), dealing, however, with 
complexity analysis rather than with fast fourier transforms. 


REFERENCES 


J. W. Cooley and J. W. Tukey, Math. Comput. 19, 297-301, 1965. 

W. M. Gentleman and G. Sande, Proc. AFIPS 1966 Fall Joint Comput. Conf., Vol. 29, 
pp. 563-578. 

R. C. Singleton, Comm. ACM 10, 647-654, 1967. 

R. C. Singleton, Comm. ACM 11, 776, 1968. 

5) J. E. Hopcroft, Complexity of computer computations, in J. L. Rosenfeld (ed.), 
Information processing 1974, North-Holland Publ. Co., Amsterdam, 1974. 


R 908 Е Philips Res. Repts 30, 302*-315*, 1975 
| Issue in honour of С. J. Bouwkamp 


THE N-PORT RECEIVING ANTENNA 
AND ITS EQUIVALENT ELECTRICAL NETWORK 


by A. T. DE HOOP *) 


Delft University of Technology 
Delft, The Netherlands 


(Received January 17, 1975) 


Abstract 


A. rigorous proof is presented of the commonly accepted theorem that 
an N-port receiving antenna is, in several respects, equivalent to an 
N-port electrical network containing internal sources. Expressions are 
derived for the quantities that specify the Thévenin representation of this 
network. As a basic tool, the reciprocity theorem that relates the electro- 
magnetic fields occurring in the transmitting situation to those occurring 
in the receiving situation, is used. The most general N-port antenna, if 
only linear, passive and time-invariant, is investigated; nonreciprocal 
ones are included as well. The incident radiation consists of an arbitrarily 
elliptically polarized plane wave. With the aid ofthe equivalent network, 
the condition for maximum power transfer from the incident wave to 
an N-port load is derived. 


1. Introduction 

The electrical properties of a receiving antenna, for example one that is used 
in а communication system, are often specified more or less intuitively in terms 
of an equivalent electrical network with internal sources +). The parameters of 
the relevant network are commonly accepted to follow from related quantities 
that characterize the same antenna in the transmitting situation, while the 
strengths of the internal sources will depend on the amplitude, phase and state 
of polarization of the radiation that is incident upon the antenna in the receiving 
situation. ; 

The purpose of the present paper is to show how this representation can be 
justified rigorously. At the same time, expressions are obtained for the network 
parameters involved as well as for the strengths of the internal sources. As a 
result, the electrical network that characterizes the properties of the antenna 
in the receiving situation is completely specified. The conditions under which 
the representation is shown to hold. are: 

(a) both in the transmitting and in the receiving situation the antenna is acces- 
sible at a finite number of ports at which either the low-frequency voltages 
and currents replace the general field concept or the single-mode waveguide 
description (as in a microwave antenna) holds; 


*) Dedicated to my friend Dr С. J. Bouwkamp, whose penetrating way of thinking has had 
a substantial influence on my scientific development. 
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(b) in the receiving situation the incident electromagnetic radiation consists of 
a uniform plane wave (with arbitrary amplitude, phase and state of polar- 
ization); 

(c) the antenna system is linear in its electromagnetic behaviour. 

No further restrictions are imposed. In particular, the material of which the 

antenna is made, may be lossy, inhomogeneous and anisotropic; nonreciprocal 

antenna configurations are included as well. 

The theorem upon which our considerations are based is the reciprocity 
theorem that relates, for a single antenna, the electromagnetic fields occurring 
in the transmitting situation to those occurring in the receiving situation. This 
theorem has been derived by the present author on a previous occasion 2) 
and is shortly reconsidered here. Subsequently, it is applied to the configuration 
under investigation, upon which the desired network representation is obtained. 

Once the equivalent network pertaining to the receiving situation is known, 
several problems related to the further use of the antenna can be solved by 
employing pure network methods. Among this is the problem of maximum 
power transfer from the incident wave to a load that is connected with the 
accessible ports. We show that maximum power transfer occurs if the imped- 
ance matrix of the load is the complex conjugate (not the Hermitean con- 
jugate) of the input-impedance matrix of the antenna in the transmitting situa- 
tion. The load is then “matched” to the antenna. 

For recent developments in the network-theoretical aspects of the scattering 
properties of an N-port receiving antenna, we refer to papers by Mautz and 
Harrington ?) and Harrington and Mautz ^). 

Network-theoretical aspects of minimum-scattering antennas are dealt with 
in papers by Kahn and Kurss 5) and Wasylkiwskyj and Kahn 5). 


2. Description of the configuration 


The antenna system under consideration occupies a bounded domain V in 
space. Externally, V is bounded by a sufficiently regular closed surface So; 
internally, V is bounded by a sufficiently regular closed surface S,. The sur- 
face S, is considered as the termination of the antenna system, and on it a 
finite number N of ports is defined through which the antenna system is 
accessible from the “interior” (fig. 1). Parts of Sọ and S, may coincide. The 
region V thus introduced allows us to distinguish the antenna system from the 
environment into which it radiates or scatters, as well as from the terminals 
at which it is accessible. The cartesian coordinates of a point in space are de- 
noted by x, у and 2; the time variable is denoted by г. The position vector is 
denoted by r. The electromagnetic fields occurring in the transmitting situation, 
as well as those occurring in the receiving, situation, are assumed to vary 
sinusoidally in time with the same angular frequency w. The complex represen- 
tation of the field vectors is used, and in the formulas, the complex time factor 


А 


304* A. T. DE HOOP 


environment : leg, t} 


Z 
(n 


ng 
V accessible ports 


Fig. 1. Antenna configuration with N accessible ports. 


exp (—iwt), common to all field components, is omitted. 

The antenna consists of a medium, the electromagnetic behaviour of which 
is linear and passive, no further restrictions as to its electromagnetic properties 
being imposed. The properties of the medium may change abruptly when cross- 
ing a (bounded) surface, but, across such a surface of discontinuity in properties, 
the tangential parts of both the electric- and the magnetic-field vector are 
continuous. Other parts of the antenna system may consist of conducting sur- 
faces. These surfaces are assumed to be electrically perfectly conducting, and 
on them the tangential part of the electric-field vector vanishes. 

The medium outside S, is assumed to be linear, homogeneous, isotropic and 
lossless, with real scalar permittivity eo and real scalar permeability шо; this 
includes the case of free space. 

In the following sections, E, H, D and B denote the space- and frequency- 
dependent complex representations of the electric-field vector, the magnetic- 
field vector, the electric-flux density and the magnetic-flux density, respectively. 
All quantities are expressed in terms of SI units. The superscripts T and К are 
used to denote the transmitting and the receiving situation, respectively. 


3. The antenna in the transmitting situation 


In the transmitting situation (fig. 2) the accessible ports of the antenna are 
fed by an N-port source. Let 7,” denote the electric current fed into the nth port, 
and let V," denote the voltage across the nth port (n — 1, ..., N). (In the 
single-mode waveguide description, J,” and V, denote the complex amplitudes, 
in a chosen transverse reference plane, of the transverse parts of the magnetic 
and the electric fields pertaining to the waveguide mode under consideration; 
the relevant transverse modal functions should be properly normalized.) Аз a 
consequence of the uniqueness theorem of electromagnetic fields, the voltages 
{V,7} are linearly related to the currents {7,7} through the relation 


N ; 
Van = У Zma” I (m = l, Өө» N), (1) 


n=1 


where Z,,,,!" defines the input-impedance matrix of the radiating antenna system 
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transmitted wave: (E, HT) 
же! 


environment: (ej, ty) 


Fig. 2. Antenna in transmitting situation; the accessible ports are fed by a source. 


in the transmitting situation. The time-averaged electromagnetic power Різ fed 
into the antenna system in the transmitting situation is then given by 


nad e| à VAT i| Q) 


m=1 


where * denotes the complex conjugate. With the aid of (1), eq. (2) can be 
rewritten as 


N N 
ріп — 4 ref 3: Y Zi E I* 1 г] 
m=1 n=1 


NON (3) 
—LY 3 Gaal Za ta ТТ. 


т=1 п=1 
In the domain У, between So and 5;, the electromagnetic-field vectors satisfy 
the source-free electromagnetic-field equations 
curl НТ + iw DT = 0, (4) 
curl ЕТ — iw ВТ = 0, (5) 
and the constitutive equations which express (D7, B") linearly in terms of 


{ЕТ, НТ). Owing to the continuity of the tangential parts of E and HT across 
S, we can rewrite the expression for P!^ as 


накан”) аа ©) 


where n denotes the unit vector along the outward normal. 
In the domain outside Sp, ET and Н? satisfy the source-free electromagnetic- 
field equations 
curl H? + iw £ ET = 0, (7) 


curl ЕТ — iw uy HT = 0. (8) 
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In addition, the transmitted field satisfies the radiation conditions 78:9) 


EXHT? -F(eo/uo)!? ET = O(r^?) as r— 00, (9) 
T XE? — (uo/ceg)? HT = O(r-7) as  r— о, (10) 


| jn which r(= r/r) denotes the unit vector in the radial direction and r is the 
distance from the origin to a point in space. As a consequence of eqs (7)-(10), 
the following representation holds: 


{ЕТ (ть), H7(r9)) ~ (e" (£5), h7(Fp)} exp (i ko rp)/4 arp as  "rp— oo, (11) 
in which 
kg = w (£g uo)! ? = 2z[Ao, (12) 
До being the wavelength in the medium outside Sọ. P denotes the point of 
observation with position vector rp. Between e" and h? the following relations 
exist: 
eT = (uo/éo)*/? (ВТ x15), (13) 
h? = (eo/1o)!? (Fp хет). (14) 


These relations, together with eq. (11), are in accordance with eqs (9) and (10). 
We shall denote eT = e7(Fp) as the electric-field and h” = h"(r») as the magnetic- 
field amplitude radiation characteristic of the antenna system. For a given N- 
port antenna, they only depend on the direction of observation and on the way 
in which the N accessible ports are fed. Both amplitude radiation characteristics 
are transverse with respect to the direction of propagation of the expanding 
spherical wave generated by the antenna, i.e. fp. e" = 0 and fp. h" = 0. We 
exhibit the dependence of eT and h? on the way in which the N accessible 


ports are fed, by writing 
N 


{ет, h7} = у {e,7, b,” } 1,7. (15) 


п=1 


The time-averaged electromagnetic power P7 radiated by the antenna is given by 


Pr = gRe{ Sf Æ) naa |, | . (16) 
So 


where n denotes the unit vector along the direction of the outward normal. 
Since the medium outside Sp is lossless, we can replace So in ед. (16) by a 
sphere whose radius is taken to be so large that the representation (11) holds. 
Then, we can rewrite eq. (16) as 


1 £y M7? 
o 
РТ = — eT. eT* dQ, (17) 
2 
32 л Ho o 
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where 2 denotes the sphere of unit radius. Incidentally, eq. (17) proves that 
РТ > 0 for any nonidentically vanishing eT. With the aid of (15), eq. (17) can 
be rewritten as 


Pray У +7 "x JN | ет е,7* .е„Т dQ. (18) 


т=1 п=1 


Now, for a lossless antenna we have РТ = P", In this case the right-hand 
sides of (3) and (18) should be equal, irrespective of the values of {7,7}. This 
condition leads to the relation 


iQ T Zi Urb) — 32 л? - y JE m тж e d dQ. (19) 


A relation similar to (19), has been derived by Van Bladel !?). 

It is known that eT = еТ(Р„) and hT = ҺЇ(ТЬ) can be expressed in terms of 
the values that the tangential parts of ET and HT admit on Sọ; the relevant 
expression for е? is ?) 

eT = iko fp X f f n x E"(r) exp (—i ko £p .r) dA + 


So 
— iko (uo/ £9)? Ёр х LE f[[ nxH'() exp (— Ko Ip . r) aa (20) 
So 


Our proof of the reciprocity relation is in part based on this expression. 


4. The antenna in the receiving situation 


In the receiving situation a time-harmonic, uniform, plane electromagnetic 
wave is incident upon the antenna system, while the accessible ports are con- 
nected to an N-port load (fig. 3). As incident field {Е!, H') we take 


E! = A exp (—i ko & . r), (21) 
Н! = (€0/o)*/? (Ax a) exp (—i ko а. r), (22) 


where A is a constant, complex vector that specifies the amplitude and the phase 
of the plane wave at the origin, as well as its state of polarization, and —a 
denotes the unit vector in the direction of propagation. (We call « the direction 
of incidence.) The state of polarization is, in general, elliptic, but is linear if ` 
А X A* = 0 and circular if A. A = 0. Since the wave is transverse, we have 
a. A = 0. In the domain outside So, the scattered field (E*, H*) is introduced 
as the difference between the actual (total) field (EF, H*) and the field of the 
incident wave {Е!, H'}: 


def 


E ЕКЕ, Н: нн! (23) 


Y 
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scattered wave: [ЕЗ НЗ} incident wave: (E! Н!) 


environment: {Eg нд] 


59 


Fig. 3. Antenna in receiving situation: а uniform, plane, electromagnetic wave is incident 
on it and the accessible ports are connected to a load. 


In the domain outside Sp, both the incident and the total fields, and hence 
the scattered field, satisfy the source-free electromagnetic-field equations 


curl Ні + iw e, E's = 0, (24) 
curl ES — iw uo H's = 0. (25) 
In addition, the scattered field satisfies the radiation conditions 
IX HS + (e9/o)!/? Е° = O(r-?) as  r— oo, (26) 
EXE — (uo[eg)!? HS = О?) as г— о. (27) 


As a consequence of eqs (24)-(27) the following representation holds: 
(E*rp), H(rp)} ~ (e*t), h(£5)) exp (i ko rp)/4 n rp as Гр —> 00. (28) 


Between e? and hs, relations of the type (13)-(14) exist, and for e, а representa- 
tion similar to (20) can be obtained. 

The time-averaged power P^ = P¥(a), received by the antenna system, is 
given by 


= 3 Re p [ Bx H**) . n вл) (29) 


So 
Further, the scattered power Р° = P*(a) is defined as 


р: 5 Ве [s ŒxH*). n ал (30) 
50 


Since the medium outside Sp is lossless, we can, on account of (28), rewrite 


(30) as 
"wa ene o 
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Next, we add (29) and (30), subsequently use (23), (21), (22) and the represen- 
tation of е° similar to (20), and observe that the incident wave would in the 
absence of the antenna dissipate no power when travelling in the domain inside 
So. This procedure leads to ; 


PR + P5 = 1 Ке [(iw Ho)" А*. e'(—a)]. (32) 


Equation (32) is directly related to the “cross-section theorem” in electro- 
magnetic scattering 11:12), 

In the domain У, between S, and S,, the electromagnetic-field vectors satisfy 
the source-free electromagnetic-field equations 


curl Н“ + iw DF = 0, (33) 
curl E^ — iw B^ = 0, (34) 


and the constitutive equations which express (D^, B®} linearly in terms of 
(E^, Н“). The time-averaged electromagnetic power P. = P'(a) dissipated 
in the load is given by 


Pi = аве аен) naa | (35) 
51 


Owing to the continuity of the tangential parts of Е“ and Н“ across S}, we 
can express P^ also in terms of the electric currents {7,8} flowing into the 
load and the voltages (V,*) across the ports of the load. The result is 


N 

Р! = } Jl È Vn? ae (36) 
m-1 

Since the electromagnetic properties of the N-port load are assumed to be 

linear, the voltages {V,,*} are linearly related to the currents (7,*) through the 


relation 
N 


Vn =F Zm IE  (m=1,..., N), (37) 


n=1 n 
where Z,,," defines the impedance matrix of the load. With the aid of (37), 
eq. (36) can be rewritten as 


N N 
PŁ —4Re | Y УЛ” nA 2l 


m=1 п=1 
N N 
= i by ` (Zma + Zam”) 2.8 [л (38) 
т=1 п=1 


Now, for a lossless antenna we have Р“ = PF. In this case we obtain from 
(31), (32) and (38) 
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N 


i» S aas OLLI 


N 
итам Й TES dam 


= 4 Re [(io uo)" * A* . e(—)]. (39) 


5. The reciprocity relation 


The starting point for the derivation of the reciprocity relation is Lorentz’s 
reciprocity theorem for electromagnetic fields +°). This theorem can be applied, 
provided that the electromagnetic properties of the medium present in the trans- 
mitting situation and those of the medium present in the receiving situation are 
interrelated in such a way that, at all points in space, the relation 


E” . DF — E£ . DT —HT.B* + Не. B' = 0 (40) 


holds. In the domain outside Sọ this is obviously the case as the constitutive 
equations here are simply D = e; E and B = uo H, both in the transmitting 
and in the receiving situation. In the domain V, between S, and S,, the situation 
may be more complicated. Here, eq. (40) holds without change of properties 
of the medium when the medium is reciprocal. In all other cases, the medium 
is nonreciprocal, and the appropriate change in properties has to be made 
when switching from transmission to reception and vice versa. It is noted that, 
in the general condition (40), nonreciprocal media, including those showing the 
magnetoelectric effect 14), are included. If (40) is satisfied, Lorentz's theorem 
states that 


ff (ETXxH* — EFXH7T).ndA = 0 (41) 
S 


for any sufficiently regular, bounded, closed surface S, provided that the domain 
bounded by S is free from electromagnetic sources. If (41) is applied to a domain 
outside Sp, we may, on account of eqs (23)-(25), also replace (E^, H?} in 
(40) and (41) by either {Е°, H:) or (E', Hi}. 

Let S, denote the sphere with radius r and centre at the origin, where r is 
chosen so large that S, completely surrounds So. Since the fields (E7, НТ} 
and {Е°, H°} both satisfy the radiation conditions (eqs (9) and (10) and (26) 
апа (27), respectively), we have 

lim ff (ETXHs: — ESXH7).ndA = 0. (42) 
r0 S. 
Consequently, the application of Loientz's theorem (41) to the domain bounded 
internally by So and externally by S,, and to the fields (E7, НТ} and (E^, H°} 
leads in the limit r — oo to 
/] (ЕТ хН* — EXH7).ndA = 0. (43) 


So 
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Next, we observe that, from (20), (21) and (22), it follows that 
[f @*xH' —Е!хНТ).п аА = (iw uo)7* A. eT (a). (44) 


So 
On adding eqs (43) and (44), and using (23), we arrive at 
ff (7x HR — Е®хНТ).п dA = (iw uo)7* A . e7 (a). (45) 


So 
We proceed with the application of Lorentz’s theorem (41) to the domain V, 
bounded internally by S, and externally by S, and to the fields {E7, НТ} 
and (E^, НЕ}, which gives 


[[(ETxH* — E*xH7).nd4 = ff (ETXxH* — ERXH7).ndA. (46) 


S1 So 


However, on S, the field description in terms of the N accessible ports holds. 
Taking into account the direction of n and the chosen directions of the currents 
(1,7) and {1,5}, we can rewrite the left-hand side of eq. (46) as 


N 
ff (ETXx H® x: ЕЁ xH’) -nddA = — У; (S LE T Vut In”). (47) 
51 т=1 | 
Combining eqs (45), (46) and (47), we arrive at the amplitude reciprocity rela- 
tion (cf. ref. 15) 


N 
У Vim” IP + V,F 1,7) = —(io ug)? А. e(a). (48) 


т=1 


This relation will serve as the starting point for the derivation of the network 
representation of the antenna in the receiving situation. 


6. The equivalent network for an N-port receiving antenna 


In the reciprocity relation (48) we substitute eq. (1) in the left-hand side and 
use (15) in the right-hand side. Rearranging the result we obtain 


N N N 
25 ( È Zam" 1,8 + nh) I," = —(io uo)? У, А. e," (0)] L,". (49) 


т=1 \п=1 m=1 


This equation should hold irrespective of the values of {J,,7}. As a con- 
sequence, it follows that 

N 

Y Zim? 18 + Га" = Es (n= 1,.. N), (50) 


n=1 
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in which the “equivalent electromotive force” E,,* = E,,*(a) is given by 
E,R = —(iw uo)" А.е) (т = 1,..., N). (51) 


Equation (50) describes the properties of an N-port electrical network with 
internal voltage sources (Thévenin representation). The N-port network whose 
network equations are (50), is the equivalent electrical network for the antenna 
in the receiving situation. Equation (51) shows how E,,* depends on the ampli- 
tude, the phase and the state of polarization of the incident wave and, through 
the geometrical factor e,,7(«), on the direction of incidence. Note that the 
internal-impedance matrix of the network is the transpose of the input-imped- 
ance matrix of the antenna in the transmitting situation. 

One application of eqs (50) and (51) is the experimental technique for deter- 
mining the electric-field amplitude radiation characteristics (e,,7(«)) of a given 
N-port antenna, in the transmitting situation, by using the antenna as a receiving 
antenna and measuring its reaction on an incident plane wave. А simple pro- 
cedure performing this, runs as follows: 

(a) a specific value of « and two different values of A, corresponding to two 
independent states of polarization, are chosen (the values of A can be deter- 
mined experimentally by removing the antenna and measuring E/ at the 
origin of the coordinate system, the latter point serving as a reference point 
for the phase of the fields); 

(b) all ports are left open, i.e. ҺЕ = 0 for all n, and У, is measured for all m 
(in this case V, equals E,F as (50) shows); 

(с) from eq. (51) we calculate the two complex components of e," at the 
selected value of « (note that both A and e,,7 are transverse with respect 
to a); 

(d) new values of « and A are selected, and the procedure is repeated until 
enough values of {e,,7(«)} have been obtained. 


7. The condition for maximum power transfer from the incident wave to the load 


Let now an N-port load be connected to the accessible ports. Then, the rela- 
tions (37) hold. For any given load, substitution of (37) in (50) leads to a system 
of N linear, algebraic equations, from which the currents {1,6} can be deter- 
mined. Once this has been done, the power PF. dissipated in the load can be 
calculated from (38). 

One of the problems associated with the use of a receiving antenna is to design 
such a load that, for a given antenna system, maximum power is transferred 
from the incident wave to the load. On account of the relations (37) and (50), 
this is a pure network-theoretical problem, and it has a unique solution. The 
answer is that the impedance matrix of the load should be the Hermitean con- 
jugate of the internal-impedance matrix of the electric network under consider- 
ation. For reference, a proof of this statement is included in an appendix. А$ 
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the internal-impedance matrix of the equivalent network for the antenna in the 
receiving situation is the transpose of the input-impedance matrix of theantenna 
in the transmitting situation, maximum power is dissipated in the load if 


Zmn = Zaat (тп = 1, ..., №). (52) 


Hence, for maximum power transfer, the impedance matrix of the load should 
be equal to the complex conjugate (not the Hermitean conjugate) of the input- 
impedance matrix of the antenna in the transmitting situation. If (52) holds, the 
load is called “matched” to the antenna. Apparently, the condition for matchin g 
is independent of the direction of incidence a. 


Appendix 
The maximum-power-transfer theorem for an N-port network with prescribed 
internal sources 

We consider a linear, passive, time-invariant electric network with prescribed 
internal sources and N accessible ports (N > 1). The network is operating in 
the sinusoidal steady state. The equations corresponding to the Thévenin repre- 
sentation of the network are given by б 


N 
254 І, + Vin = En (т=1,..., N), (A.1) 
n=1 
where {7,} are the currents flowing out of the ports, {Vin} are the voltages 
across the ports, Z,,,, defines the internal-impedance matrix and {E,,} are the 
equivalent electromotive forces of the internal sources. The time-averaged power 
Р" dissipated in an N-port load is given by 


N 
РЕ = } | E Vm a (A.2) 
m=1 


We now define the “optimum state” of the network as the one that maximizes 
P^, In the optimum state, let {V,,°°'} be the voltages, {J,°°'} the currents and 
Рі! the power dissipated in the load. Then 


N 

PEopt — 4 Re| È Ро?“ d (A.3) 
т=1 

Let us now consider arbitrary (not necessarily small) variations *) {ôV m} and 

{ôl} around. {V,,°°'} and {7,°P'}, respectively, and take 


Vin = Vn + 6V_ (m=1,..., N) (A.4) 
and 

I, = Ino? + 81, (n —1,..., №). (A.5) 
*) While preparing the manuscript, the author's attention was called to a paper by Desoer 16), 


where a similar line of reasoning is followed and where a more sophisticated treatment is 
given. 
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к 


While varying the voltages and the currents, we keep {Z,,,,} and {En} fixed. 
Consequently, {6V,,} and {6/,} are interrelated through | 


N 
У Zmnn 81, + OV, = 0 (m=1,..., N), (A.6) 
n=1 


as follows from (A.1). Substitution of (A.3), (A.4) and (A.5) in (A.2) yields 


N 
PE = phot 4.4 Ве | Y (OV m Lno?* 4- V, 8L, + OV m ate) | (A.7) 


m-i 
Since 
+ Re (0V,,, In? *) = + Re (дУ„* LP), (A.8) 
the first-order terms on the right-hand side of (A.7) cancel if 
N 
Re| У; (6V,,* Ir Kot 22] = 0. (А.9) 
т=1 y 


On account of (А.б), this can be rewritten as 


N N 
Re| Y (- Y Zam L9 + v") л | m (A.10) 


1—1 п= 1 


Equation (A.10) holds for arbitrary complex (07,*), provided that 


N 
y,” = У Z, m* 1,99% (m=1,..., N). (A.11) 


n=1 


Introducing the impedance matrix of the load through 


N 
Vn = У Zm Ig (m=1,..., N), (A.12) 


n=1 
eq. (A.11) implies that in the optimum state we have 
Zinn’ = М" (m,n=1,..., N). (А.13) 


Equation (A.13) states that the optimum N-port load has an impedance matrix 
which is the Hermitean conjugate of the internal-impedance matrix of the net- 
work. It now remains to be shown that the condition (A.9) maximizes Р". To 
this aim we observe that if, apart from the sources, the given N-port network 
is passive, we have 


N N 
i ref > Ms Zm,n n | 20 (A.14) 


т=1 п=1 
for any sequence {6/,}. Using (А.б), (A.14) and (A.9) in (A.7) we arrive at 
PE < pron (A.15) 
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for any sequence {d/,}. This result completes the proof. If the given N-port 
network is dissipative, the left-hand side of (A.14) is positive for any non- 
identically vanishing sequence {6/,} and hence PL < Ріо! for any non- 
identically vanishing sequence {6/,}. i 
The time-averaged power P, that is dissipated internally in the network is 


given by 
N 


; N 
Р, = zz E NEZ 1 


т=1 п=1 


N N 
= 4 Ке | ISO ME. 2 (A.16) 


m=1 n=1 


In the optimum state this reduces, on account of (A.13), to 
Рут! = PEM, (A.17) 


The time-averaged power P, that is delivered by the sources is given by 


N 
Р, = ref "E (A.18) 
т=1 


Using (A.1), (A.2) and (A.16), this becomes 
P, = P, + PE. (A.19) 
In the optimum state we therefore have, by virtue of (A.17) and (A.19) 
pt Phopt — j ppt : (A20) 
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Abstract 


The resistance of a bulk resistor exhibits random fluctuations as a func- 
tion of time. A relation is derived between these fluctuations and the 
second-order statistical properties of the conductivity of the bulk mate- 
rial. 


1. Introduction 


A number of electrical noise phenomena can be adequately modelled by a 
piece of electrically conducting material exhibiting randomly fluctuating con- 
ductivity and provided with two ideally conducting electrodes. Apart from the 
random fluctuations, the material is assumed to be homogeneous, i.e. the time- 
averaged value of conductivity is constant throughout the occupied volume. If 
the configuration is excited by a direct current J) at the terminals, a direct volt- 
age will occur across the terminals at T = 0 (zero temperature), while a (small) 
noise voltage will be superimposed for T > 0. 

It is the purpose of this paper to derive a relation between the root-mean- 
square value of the noise voltage of the resistor considered above and the 
second-order statistical properties of the conductivity of the bulk material. It 
will be found that a noise voltage occurs only if there is a certain correlation 
between the conductivity fluctuations at neighbouring points. The theory is 
developed from a purely macroscopic point of view without any reference to 
the granularity of the conductance mechanism. 

Problems of this type arise in the study of 1/f noise 12,3). In this connection, 
the author is indebted to his colleagues F. N. Hooge and L. K. J. Vandamme 
for drawing his attention to the present subject matter and for formulating the 
problem discussed above. 

The problem is solved in two steps. First, the (small) space-time variations of 
conductivity are assumed to be described by deterministic functions. In the 
second step, the stochastic nature of the fluctuations is taken into account. 
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2. The deterministic problem 


In accordance with fig. 1, we assume that two ideally conducting electrodes 
1 and 2 with surfaces A, and A, are embedded in a material (volume V) with 
conductivity o(x, у, 2, t). The direct current J, enters the configuration at elec- 
trode no. 1 and leaves it at electrode no. 2. The quantity to be determined is the 
voltage V,,(t) between the two electrodes. 

For the sake of simplicity, the bulk material is assumed to extend to infinity. 
Only minor modifications are required in our treatment, if finite dimensions of 
the configuration have to be taken into account (cf. sec. 4) or when one elec- 
trode is assumed to enclose the other (e.g. in the form of two concentric spheres). 

The fluctuations of o(x, y, z, t) are assumed to vary so slowly in time that a 
quasi-stationary approach becomes applicable in which both displacement cur- 
rents and eddy currents are neglected. This implies that the system under consid- 
eration is memoryless, i.e. the terminal voltage V,, at time to is determined by 
о(х, y, 2, to), uninfluenced by the past history of the conductivity. Hence, omis- 
sion of the parameter ¢ in the various time-dependent quantities о, V, ,, etc. is 
justified in the following analysis. 

At a later stage of the analysis, the conductivity function o(x, y, 2) will be 
assumed to consist of a constant value and small superimposed variations. This 
assumption of weakly varying conductivity allows a first-order perturbational 
analysis the results of which can be cast in closed form. 

With J denoting the vector of current density and @ the electric potential as 
functions of the position (x, у, 2) the current-potential distribution in the 
conductor is governed by the basic differential equations 


div J — 0, (1) 
J = —о grad g. (2) 


These equations have to be solved under the boundary conditions (cf. fig. 1 for 
the direction of the normal vector n) 


I = fJ.nd4 =— fJ.ndA4, Фф = constant on A, and 4,. (3) 
А2 


А1 


Fig. 1. Two-electrode configuration and associated flow lines of electric current density. 
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Finally, the unknown voltage V,, is associated with y according to 


Vi; = ф(А,)— 9(42), (4) 


where g(4,) and ф(А„) denote the potentials on the equipotential surfaces 4, 
and A,, respectively. 

After insertion of (2) in (1) and (3), we obtain a well-known problem of 
potential theory with an infinite number of solutions ф(х, y, z) differing by 
additive constants. For the determination of V,, as the difference of two 
potentials, all solutions are equivalent, but for the sake of convenience we will 
make the potential unique by the additional requirement 


9 — 0 at infinity. (5) 


Let us assume that the foregoing potential problem has been solved 
for the case of a constant conductivity о(х, у, 2) = со with the solution 
ф(х, y, 2) = qox, у, z) and I(x, y, 2) = Jo(x, y, z). Notice that, in general, фо 
and J, depend upon position, whereas со is constant. We will now assume that 
с deviates from со by a small amount óc giving rise to slightly deviating poten- 
tials and current densities as follows: 


o(x, Ys 2) = O09 + do(x, У, 2), 
ф(х, У 2) = Pox, Y, 2) + бф(х, ys 2), (6) 
J(x, y, 2) = Jo(x, у, Z) + ôJ(x, y, 2). 


Both the triplet c, p, J and the triplet со, Po» Jo satisfy (1), (2), (3) and (5) with 
the same current J, in either situation. Subtraction of corresponding equations 
ylelds the relations 


div 6J = 0, (1) 

ôJ = — со grad дф — do grad фо — дс grad dg, (2 

0 = f ôJ.ndA —— fóJ.ndA, бе = constant on 4, and 4,, (3’) 
A2 Ai 

бф — 0 at infinity. . (5) 


If, by way of a first approximation, the second-order term де grad бф is ne- 
glected in (2^), one obtains, after insertion of (2’) in (1^, the Poisson equation 


def дс 
A(óg) = div grad дф = — grad (=) . grad фе, (7) 


To 


which has to be solved under the boundary conditions (3’) and (5^). 
If one is explicitly interested in the first-order perturbation field бф, (7) has, ` 
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indeed, to be integrated *). However, since we only wish to determine the 
terminal voltage V,,, we need not adopt that procedure. In addition, it happens 
that the local condition “|до grad 2 is small” сап be replaced in our case by a 
weaker integral condition. 

To this end, we consider the vector identity 


div (фо 6J — бф Jo) = grad фо. д3 — grad ôg . Jg = 
= — óc [grad Pol? — до grad бф. grad Po, (8) 


where use has been made of (1), (1’) and (2’). This relation is now integrated 
over the region V occupied by the material under consideration. With Gauss' 
theorem the left-hand side is converted into surface integrals over the two 
electrodes (owing to the asymptotic behaviour of фе, бф, Jo, 6J the surface at 
infinity does not yield a contribution): 

f[(999J— dp Jo). ndA + f (p; ôJ— 0p Jj) .ndA = 

A1 Аз 


= [(— дс |grad Pol? — óc grad ôọ . grad фе) dv. (9) 
LA 


In the integrals of the left-hand side, фо and бф are constant along the equi- 
potential surfaces A, and A, and may therefore be removed from the integrals. 
Furthermore, by virtue of (3, the remaining integrals of 6J vanish. With (3), 
which is also valid for J replaced by Jo, we thus arrive at 


0Vi, lo = — f бо (|grad Pol? + grad дф. grad фо) dv, 
y (10) 
with 
ду, = 0g(4,) — Óg(A;). 


If the second term in the integrand (which is small, of the second order), yields 
a contribution to the integral which is substantially smaller than that of the first 
term (a weaker condition than the above-mentioned local one), we may 
approximate (10) by 


ду, lo я#— f óc [grad gol dv, (11) 
V 


which yields the voltage deviation 6V,, when óo(x, у, z) and (x, у, 2) are 
known functions of position. 

Proceeding along similar lines, we can also determine the “unperturbed” 
voltage (V;2)9 from the (exact) relation 


(Y:3)o lo = f oo [grad Pol? dv (12) 
v 


which can immediately be interpreted in terms of dissipated energy. 


*) Such an integration is also necessary if a four-terminal configuration is considered. A special 
case of such a configuration will be investigated in a forthcoming paper. 
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3. The stochastic problem 


In accordance with previous considerations, (11) is valid at each instant of 
time; hence, after division by J), the more detailed formulation of (11) reads as 


1 
6V,(t) =— ГА fff до(х, у, z, t) lgrad qox, у, z)? dxdydz. (13) 
0 V 


We now wish to determine <(6V,,)*), the time average of [6V,,(t)]*, due to 
random fluctuations of óo(x, y, z, t). To this end we introduce the “correlation 
function" 


def 
Rpo = «до(хь, JP 2р, t) бо(хо» Уо» ZQ» t)> (14) 


which allows us to write ((6V,,)*) in the form 


1 
{(6V12)?) = — Jf Кьо |grad Фо(хь, Y; zp)|? |grad Фо(хо, Ya: Ze) |? dvp dvo. 
I? уу 
(15) 


With the aid of (15), the root-mean-square value of the noise voltage 
(Vaoise)ams = (У, 2)?! /? can be evaluated from a given correlation function 
of the conductivity fluctuations and a given unperturbed potential distribution. 

In most practical cases, (15) can be further simplified by the assumption that 
the “correlation length" r, is small compared with all characteristic dimensions 
of the configuration. More specifically, it is assumed that |grad Pol? does not 
exhibit substantial variations owing to spatial displacements of the order of r,. 
Concerning the evaluation of (15), this means that the main contributions to the 
integral originate from pairs of closely neighbouring points P, Q, which allows 
the simplification 


1 
9/2) = 77 S MP) grad фобхь, ур, 20) dv» 
Qo V 


with (16) 
M(P) = f Reo dvo. 
y 


Furthermore, if the medium is homogeneous with respect to the statistical noise 
properties (“spatial stationarity”), M(P) = Mo becomes a constant which may 
be removed from the integral: 


Mo Я 
(6,2)? = 7 f |grad фо de. (17) 
0 V 


This confirms previous semi-empirical results found by Honig 3). 
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4, Example 


The following simple example illustrates the final result (17). With reference 
to fig. 2, a cube of side length B is provided with two parallel plane electrodes. 


Fig. 2. Cubic configuration containing noisy material. 


Here the material does not extend to infinity, but it can be easily seen that (17) 
holds also in this case (the “boundary conditions” at infinity have then to be 
replaced Бу Jo.n = 0 and ôJ . n = Oat the open surface of the material). Further- 
more we assume that Rpg is constant inside a small cube of side length b centred 
at P and zero outside the cube *). Considering that 


Rpr = «[до(хь, ур, ze?) = ((60)?) 
we obtain from (16) 
My = f Reg dvo = b? «(до)?). 
Using (12) and the fact that |grad фе] is constant in the present configuration, 
(17) yields 
OV 2)? b N? (до)? 
« P B ( ) <(60)? | (18) 
(Yi3)o 


2 
In general, we can conclude that, due to b/B « 1, the relative fluctuations of the 
terminal voltage are small compared to those of the conductivity of the mate- 
rial. 


B Oo 
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Given N cubes, it is possible to construct an animal by putting the cubes 
together face to face. An animal of this kind, constructed from N cubes, is 
called an N-omino. There are two- and three-dimensional N-ominoes. Two 
N-ominoes are said to be different if they cannot be made identical by turning 
them over and around. The question which now arises is how many different 
N-ominoes it is possible to construct. 

A computer program was written to find the number of N-ominoes for some N. 
Table I lists the results for N = 3 to N = 8. Table II is a schematic represen- 
tation of the N-ominoes for N = 3 to N = 6. The cubes of an N-omino are 
indicated by a square. In the three-dimensional case an N-omino is divided 
into a bottom layer and a top layer. Free positions in a layer are indicated 
by a dot. 


TABLE 1 


The number of N-ominoes for N = 3 to N = 8 


number of N-ominoes 


two- three- 
dimensional dimensional 
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TABLE II 


Schematic representation of the N-ominoes. The cubes are indicated by the 
symbol o, the free positions by the symbol - 


3-OMINOES 


1 una 2 un 
п. 


4-OMINOES 
1 папа 2 пап 3 nna 
o- 0 
4 па 5 ап 6 пп a 
oo па a 
7 a na 8 a n 
Qo б. oa 


5-OMINOES 
i oQOaqog 2 0000 з ооов 
Bs nee 
4 nun 5 ооо 6 naa 
nna па п.п 
7 ооо 8 a: 9 a 
(m · ооо ооо 
п. o.. -o- 
10 nu ll o.e.: 12 :n- 
od nua пап 
оо п Н 
13 апп п... 14 n-- nou 15 0-- о. · 
Dee NT п... "na" ooo 
16 опоо Ги ЕЕ 17 oa: а. · 18 nana а - · 
. „Б LL *DJgJ3 LEE EE i e. «rg 


19 п... uoa 20 n-- пп. 21 п... gua 
.». . o 0- MES ME "Ba К И Q 
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TABLE II (continued) 
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22 апп 23 oa: ‘oa 24 пы. о · 
D. ' B . <00 . æ 
25 опо 26 oa: бв. 27 па. D 
+ + 8 + е + оо “о • пп 
28 пп 29 a: oo 
а. па . 
6-OMINOES 
1 aaonuaod 2 00000 3 оооп 
п а СЕ 
4 ngo-:-- 5 00000 6 ооо. · 
-0000 ‘a: воо 
7 аппа 8 оороо 9 ово: 
па... п.п п. оо 
10 оооо 11 ooao 12 опоо 
о. 0 пп nung 
I3 апа 14 nuaoug 153 a 2= 
non Dee pogo 
а + = Ж А ° 
16 o: 17: = = 18 o... 
20000 оо · · DBon 
"*D*-* . 000 е0 
19 о: 20 o: 21 000 
ooo: oooo Dec 
па беп Be 
22 ап... 23 oo:: 24 но · · 
‘ooo п... пип 
ж ы» = ооо *DBD* 
25 nu-- 26 апп 27 -n-- 
‘oa: ‘ogo oooo 
oo o- e[-* - 
28 -0-- 29 nuu 30 во. 
папап oo nna 
п> Es оО • * 


oooo0 


o000 


nau 
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TABLE II (continued) 


32 


35 


38 


41 


44 


47 


50 


53 


56 


62 


65 


68 


71 


74 


а о 


-000 


оов 


пое 


ooo 


33 


36 


39 


42 


45 


48 


51 


54 


57 


60 


63 


66 


69 


72 


75 


o-oo 
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TABLE II (continued) 


76 ae: ооо T o:> 0-- 78 
[а] беш ооо ‘oO: 
79 п... (m • · 80 o-oo DO.‘ 81 
пап беп пап A 
82 ор: ооо 83 ао па 84 
*D-* DEP * D + + 
85 ооо пп. 86 вап. пае 87 
+0 + + + 8 o п, 
88 na gag 89 DD (m · 90 
. o9 * : O о · “о 
91 пп. (m 92 пп. D-:- 93 
* BD *D- *D-* *DUJ 
94 о. · ooo 95 о. · oo: 96 
*D- *D- п. -oo 
97 в. · ап. 98 пп = 99 
* "D “оо “о · ‘O > 
100 пп = 5 101 oo: ‘a: 102 
бап ‘oO: ‘oO: са 
103 ooo "D? 104 ор єп. 105 
iv пп. 201€ a 
106 oO D- 107 п. Doa 108 
а. оо qa п 
109 o. апа 110 в. · п. 111 
о · . ». >œ пп . 
о · oe. 
112 o»: D- 113 о. · о · 114 
nag : пп. NE 
“oe * 7g 
115 пап а · 116 nna в. 117 
. . > о · *D-* soe 
Oe: *D- 


ооа 


ово 


gno: 
пп 
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TABLE II (continued) 


о · · 119 оо: а. 120 поп п, 
Улы us -Bn CVV - «D 
= D es ee * 0 
122 can imos 123 coo - ‘oOo: 
n n-- o- а б 
п. а 
‘oo 125 п. 126 п... G 
e.. naa n п. ooa 
п. о · . 
128 п... ttt 120 n-- 
n oaa п... пп a 
а п. . аа 
131 п... cee 132 o.. . 
D:- п. ппп о. · пп. 
VEA "e n rw os . 00 
65 134 о. · 135 п... 
ооо ооо [n] un na 
* D EJ . п. 
137 о. · EE. 138 0. · 
а апа ёа. oo oo 
D tos o 
140 р. · nce ie 14] п... 
а ooa єп. ooo o 
a 0 
143 о. · sse cs 144 а. · 
ап па. єп. oo па 
(m па n 
146 о. · sos 147 o.. 
nan D oo 
о з. сап a 
149 пп. па 150 ап. *n- 
o 0 · aie oo 
* D о · D- 


п. 152 пп. єп. 153 пп. ‘a: 
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TABLE II (continued) 


154 пп. sos 155 во: ett 156 пп. 
‘oa п. п. 0n п п 
п п. ап 
157 оро ла; 158 пп. oes 159 ов · 
ап бп. єп. аа -^ єп. oo 
E o *D- n 
160 пп. б 161 ов. e. 162 -n0- 
na t oo * n ooo п 
o а п. . п 
163 -a: e. 164 -o:> AE 165 -o> 
апп єп. пп. ‘oa oo: oo 
n єп. o 
166 


n 

B 

[я] 
воро 
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1. Introduction 


In my thesis") a method is described how to generate so-called c-nets 
automatically. I used a code of the network from which a planar representation 
can immediately be found; this code defines the net uniquely. From this code 
one can easily obtain the mesh-mesh incidence matrix INC automatically. 

By removing one row and its corresponding column one obtains a matrix ¢ 
that plays an important role in the determination of the currents in the net- 
work when an accumulator is placed in one of the branches assuming resistors 
of 1 Q in all of the branches. For the determination of the currents one needs 
the inverse of ¢. In my thesis this inverse was obtained by means of Gaussian 
elimination using integer arithmetic. Each element in the calculation was de- 
noted by a pair of integers (although one common denominator per row). 
In order to avoid extensive growth of the integers it was necessary to determine 
highest common divisors that could be removed from the occurring integers. 
Obviously the determination of the inverse of ¢ is rather time-consuming. In 
this paper a new method is presented that i üs much faster than the above- 
described method. To achieve this I use the fact that a network N, with a 
corresponding matrix ¢y, can be obtained from a network Nọ having one 
branch less than Nj. 

If ¢y,~* happens to be known one can obtain бп, ^ without completely 
inverting the matrix. 


2. Outline of the method 


In order to explain the method I use a reference network R and its dual Ё 
as an example. It is shown in fig. 1. The reference net R has the following 
characteristics : 


number of nodes K —9, 
number of meshes M=8, 
number of branches В = 15, 


complexity С = 1600. 
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Original Dual 
Fig. 1. Reference network R and its dual Ё. 
A code of the original network R is 
17210273203783038943049540598765016710123456100. 
A code of its dual Ё is | 
1781018210284320485405865068760123671034630456400. 


For definitions of the code and the complexity I refer to ref. 1. 

From the code one can obtain the mesh-mesh incidence matrix ІМС. 
The matrix £4 has been obtained from INC; by removing row 8 and column 8. 

The complexity of R is denoted by Cp. The matrices ІМС, ёк and Ср бк! 
are given in fig. 2. 

We now assume that we want to construct a network N of order 16 with 
K — 10 and M — 8 by adding branch 13 to the dual network and dualizing 
back. Note that INC, differs from INC, in four corresponding elements 
namely: f 

ING,[1, 3], INCy[3, 1], INCy[1, 1] and ING,[3, 3]. 


First of all we change the reference system of independent meshes. In the 
reference net R we have used the meshes 1, 2, 3, 4, 5, 6 and 7. We remove 
either 1 or 3. Let us remove mesh 3 and introduce mesh 8. The matrix with 
independent meshes 1, 2, 8, 4, 5, 6, 7 is denoted by Ск“. It can be shown that 
к = Гер I", where Г is a square transformation matrix consisting of 
elements that only take the value 1, 0 and —1 °). 
I" is the transpose of Г. Hence it follows (£4*)^* = (Г!) p71 T+. The 
inverse (ёк) : can therefore be obtained from Ск! by the following formulae: 


ды = ру + ри — Ри — Ра izüjzt 
Vn = Pu — Pii ixt 


со —-10 tA PWN н 


172 92 
300 750 350 150 
172 350 878 358 
92 150 358 638 
72 100 228 308 
124 150 326 286 
166 126 


Ср lt 
Fig. 2. INCr, £g and Cr £47! for the reference net R. 


and Qu = Pu 


where is the mesh that has been removed from Ср and p;; = element of £47! 
and qı; = element of (£4")7!. In our example г = 3. Using these relations we 


obtain the following result: 


656 928 528 
706 528 878 
440 320 520 


Cr (E)! = 


М с олњ о м н 


512 712 


The matrices £&' and ;* only differ in one element, namely the one with in- 


dices 1,1. 


706 440 


550 400 650 600 1150 600 600 
504 352 552 480 600 768 608 
480 600 


550 824 
400 352 512 
650 552 712 
600 480 480 


608 1248 
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| 7 2 8 4 


7| 1248 512 712 480 

2| 512 928 528 320 400 352 
8| 712 528 878 520 650 552 
4| 408 820 520 800 600 480 
5 
6 
1 


600 400 650 600 1150 600 
608 352 552 480 600 768 


824 656 706 440 550 504 | 1262 


Cg (£87)! 
Fig. 3. 


When we finally interchange meshes 1 and 7, we arrive at the results shown 
in fig. 3. The matrices are denoted by Ср", Cy" and Cg (C4). 
The matrix (ёк )7® is partitioned in the following way: 


Cr (Eg)! = lë ‚ where £4 = 1262, 
£5 | £3 
824 
656 
& = e and £;' is the transpose of £z. 
550 
504 


The complexity Cy of the new network equals the determinant of бу. It . 
can therefore be obtained from Cy = Cr + £4 = 1600 + 1262 = 2862. The 
new inverse (£y ")^! can be obtained from the following formula as can be 
verified easily: 
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жау] __ (2, Cy = & &,")/Cp £; 
d dE [69 GSES | 


Apparently £, Cy — & £;' must be divisible by Cp. The result is 


1808 578 910 632 
578 1391 655 392 490 423 656 
910 655 1259 736 920 765 706 
632 392 736 1310 922 720 440 
790 .490 920 922 1868 900 550 

828 423 765 720 900 1215 504 

824 656 706 440 550 504 1262 


Cy (b)? = 


— ON tA 4 бо о м 


Finally we interchange the meshes 1 and 7 and as a last step we remove mesh 8 
and introduce mesh 3 again. We then have completed our task. The result is 


1109 554 553 257 183 292 467 
554 1340 604 260 174 262 272 
553 604 1259 523 339 494 349 
257 260 523 1097 525 478 245 
183 174 339 525 1287 474 219 
292 262 494 478 474 944 412 
467 272 349 245 219 412 1247 


Суб = 


- Ota Ut мо — 


Sofar we have only added branches in the dual network. We also want to 
add branches in the original network. One could think of dualizing the net- 
work but I could not easily obtain the corresponding inverse matrix of the 

` dual network from the original network. 

Therefore I tried to derive the inverse of the new network from the inverse 
of the original network only. 


3. Adding branches in the original network 


When we add a branch in the original network it means that we split a mesh. 
When for example a branch 25 is added, mesh 8 is split into meshes 8 and 9. 
Note that mesh 8 is not one of the independent meshes in Ср. In case we split 
an independent mesh we have to change the reference system by the method 
described in sec. 2 such that this mesh is not one of the independent meshes. 
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Let us denote the new network by N;. We then obtain the following mesh- 
mesh incidence matrix INCy, : 


Cnn. A nA I» WN = 


© 


1 
2 
3 
4 
5 
6 
7 
8 


The matrix бм, is partitioned as follows: 


A 
ty, = EL |4: | , where A,’ = transpose of A,, 
2 A, | 4, 


А, = 0 |, 4, = 4 and £4 has the same meaning as in sec. 2. 


INVERSE MATRICES IN SQUARED-RECTANGLE CALCULATION 335* 


If we put £g = Cpr Ср! and ёр А, = B, we can easily show that 


(Cuz ér + B, By')/Cp| —B ; 
Cy, ny EE [ев Bie , with Cy; = Cg Az — Ay’ Er Ay. 


In our example we have: 


| —1136 
— 600 
— 664 
Cr A, =| — 504 |, 
— 464 
— 888 
—1208 


while A,‘ £& А, = 3232 and Cy, = 4 x 1600 — 3232 = 3168. 


Finally we find 
2248 1020 812 540 472 876 1420 1136 
2! 1020 1710 942 486 372 630 750 600 
3| 812 942 2014 918 644 1014 830 664 
Cy, na = 540 486 918 1422 756 846 630 504 


4 
5| 472 372 644 756 1576 804 580 464 
6 876 630 1014 846 804 1566 1110 888 
7| 1420 750 830 630 580 1110 2302 1208 
8 


1136 600 664 504 464 888 1208 1600 


4, Significance of the method 


In 1962 we calculated all c-nets of orders up to and including 19 on the 
PASCAL and STEVIN computers of Philips’ computing centre. The c-nets 
were made available on punched cards and punched paper tape. Furthermore 
all nets of order 20 were generated but kept in the computer. We investigated 
whether these nets produced any perfect squared-square of order 19. We found 
only a large number of imperfect squared-squares of order 19. In 1968 and 1969 
IBM Netherlands gave me the opportunity to use the IBM 7094 at Rijswijk 
to tackle this problem. 

D. Severein wrote programs in 7094 assembly code according to the method 
described in my thesis. As a result the nets of orders up to and including 20 are 
available now on magnetic tape. 
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We found that the number of c-nets of order 20 is less than or equal to 18953. | 
The following approach is now feasible. First we calculate for each net N of ` 
order 20 the matrix £47! by means of Gaussian elimination. Next we generate 
the nets of order 21, 22, 23 and 24 and search for existence of possible squared- . 
square solutions. To achieve this we investigate бу ! directly for the orders 21, 22 
and 23. For the nets of order 24 we only calculate the complexity and remember 
that squared-square solutions can only be obtained from a net with a com- 
plexity of the form 2kA?, where k and А are integers. ` 

The nets of order 21 will not be generated in one run. We rather generate the 
complete tree of nets and search this tree. By doing this we only need to store 
four nets with inverse matrices. Obviously we will not store the nets on magnetic 
tape. 
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Consider a catalogue S which lists one to infinitely many shapes of rectangu- 
lar bricks with positive integer dimensions. Using as many bricks of each 
shape as needed, the bricks listed in S may be used to completely fill certain 
rectangular boxes. We assume the shapes to be oriented, i.e. we are not allowed 
to turn bricks around when trying to fill a box. Thus, a new catalogue T(S) 
may be formed which lists the (infinitely many) rectangular boxes which may 
be completely filled with bricks having their shape listed in S. Some of the 
bricks listed in S may be shapes of boxes which can be filled up completely with 
smaller bricks listed in S; in other words, there may be elements s є S such that 
s € I(S\{s}). The bricks which may be formed with bricks in S smaller than 
themselves are composites. Bricks in S which are not composites are primes in S. 
If Br = B,(S) is the set of primes in S, then B, is non-empty and every box 
which can be formed with elements of S can be formed with elements of the 
subset Вг of S; in other words, Г(Вг) = I'(S) (see lemma 4). The subject of this 
note is the remarkable fact that the set of primes B,(S) is finite for every set S. 

The brief history of this problem is as follows: Results involving the tiling of 
rectangles and three-dimensional boxes with identical polyominoes and poly- 
_ cubes are discussed in ref. 3. One sort of result presented there is typified by the 
following example. The L-tetromino and two of the smallest rectangles it tiles 
are shown in fig. 1. Ana xb rectangle can be tiled with copies of the L-tetromino 


к=н 


Fig. 1. The L-tetromino and the smallest rectangles it tiles. 
*) Dedictated to our mutual friend C.J. Bouwkamp. 
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if and only if a and b are integers greater than 1 such that 8 divides ab. Also, 
every a xb rectangle having integer sides a and b greater than 1 with ab a 
multiple of 8 can be tiled with 2x4 and 3 x8 rectangles. 

If S denotes the set of rectangles which can be tiled with L-tetrominoes, one 
way to characterize S is to list the elements of Br(S) which are (2, 4), (3, 8), (4, 2), 
(8, 3); the first two are shown in fig. 1. On reading ref. 3 and doing some inves- 
tigation of his own, Frits Góbel noted that in every case where one has a char- 
acterization of the set S of rectangles which can be tiled with copies of one 
polyomino, a characterization of S can be given by listing the elements of Вг(5) 
because this always turned out to be a finite set. Thus, he was led to conjecture 
that B;(S) is finite for every set S of rectangular k-dimensional boxes. Bouw- 
kamp (see ref. 1) has done considerable work with a computer to determine the 
set of prime boxes among the boxes which can be tiled with the Y-pentacube. 

Frits Góbel described his conjecture to the second author of the present note 
who found a proof for the one- and two-dimensional instances of the conjecture. 
This proof was generalized in collaboration ?), but the proof breaks down in 
three and higher dimensions (the mistake in the proof is in the bottom third of 
p. 467 in ref. 2). The proof is illustrated for 2 dimensions, and this correct 
instance evidently convinced readers of the full generality. The mistake was 
finally noted and corrected in ref. 4. On reading this correction the first author 
of the present note thought of a simpler proof. Both proofs are presented here. 
The first proof is a corrected version of the proof which appears in ref. 2. 


Let (d;, . . ., dj) denote a k-tuple of positive integers. Such a k-tuple will also 
be called a shape, occasionally written as d. Then an (oriented) k-dimensional 
box with shape (d,,..., d,) is a set K of d, . . . d, integer points containing one 
point (the smallest) (c,, . . ., Cx) such that 


К = ((e + xs... Ck +) 0 Se, K<d—1G=1,..., bE}. 


The set of all k-dimensional boxes with shape (dı, ..., d,) is denoted 
(dis ..., dy». Boxes with shapes d;, da, ... are said to tile a box C if there 
exists a partition (C4, Cz, ...} of C with C,, С,,...є (dip U (d) Uu .... 

If S is a set of shapes then T(S) is the set of shapes of boxes that can be tiled 
by S. 

We shall also consider a more special kind of tiling (which we might refer to 
as repeated one-dimensional concatenation). If 2, b, č are shapes, and if there 
is an (1 <i<k) such that с; = а; -+ b,, and such that c; = ау = b; for all 
jÆ i, then we say that с is obtained from d and b by one-dimensional con- 
catenation. 

If S is a set of shapes, we define A(S) as the smallest set of shapes closed under 
one-dimensional concatenation, i.e. A(S) is the smallest set with the properties 


=. 


а 
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@ A(S) 2 S; 
(ii) if à e A(S), b e A(S), and с is obtained from à and b by one-dimensional 
concatenation, then сє A(S). 

For example, if d, Ё, ё are shapes, à e S, b e S, if c, = a, + b, and if c, is 
the least common multiple of a, and b; for 2 <j < k, then бє A(S). 

It is easy to see that if de A(S) then S tiles every box of shape 4. In other 
words A(S) = IS). The following example sliows that A(S) can be smaller 
than T(S): If S = (а, 4), (4, 1), (3, 3)} then (5, 5) e I'(S) but (5, 5) ¢ A(S). 

An element Ё of S is called prime with respect to A and S if b ¢ A(S \{b}). The 
set of all these primes is called B,(S) (or B, for short). 

The operator A has the following properties (for all sets S, Т): 
© Ss AG); 

(ii) if S € T then A(S) = A(T); 
(ш) ACA4(S)) = ACS); 
(iv) if à e A(S) then there exists a subset U of S with a e A(U) and such that 
every shape in U is < à (we say that b < aif b, < a for i = 1,..., k). 
From these properties we derive: 


Lemma 1.1ЁТ © S © A(T) then B,(S) = T. 

Proof. Let b be any element of B,(S). By the definition of B4(S) we have 
b ¢ A(S \ {b}. Now by (ii): b ¢ A(TY {b}. Since B,(S) € S = A(T) we have 
b e A(T), and it follows that A(T V (b)) = A(T), whence b e T. 


Lemma 2. A(B4(S)) = A(S). 

Proof. According to (i), (ii), (iii) and to B,(S) = 5 it suffices to show that 
S © A(B,(S)). We do this by showing that if S | A(B,(S)) contains a shape д 
then it contains a shape b with b < à, b + à (and that cannot go on for ever!). 
Let à be such a shape. Then à @ B,(S), whence à e A(S \ {ã}). By (iv) we 
can find Usuch that U € SY (à), āe A(U) and such that all shapes in U are < d. 
We cannot have U < A(B,(S)) since that would imply à e A(B,(S)). So we can 
take b е U such that b ¢ A(B,(S)). Since b e U = 5 \ {a} we have b < a, bes, 
b = à. Hence b e 5 \ A(B,(S)). 


Lemma 3. If T © S = I(T) then B(S) = T. 
Lemma 4. I(Bj(S)) = IS). 


Proofs of lemmas 3 and 4 are obtained from those of lemmas 1 and 2 by repla- 
cing all A’s by Г. 


We now express our main result as a theorem. 
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Theorem. Let S denote a set of k-dimensional shapes. Then we have 

(i) Shas a finite subset T such that S & I(T) (whence T(S) = Г(Т)); 
(ii) Br(S) is finite; 

(iii) S has a finite subset Т such that S © A(T) (whence A(S) = A(T); 
(iv) B,(S) is finite. 


Remark. It follows from the lemmas that (i) is equivalent to (ii), and that (iii) . 
_is equivalent to (iv). Furthermore (iii) implies (i) (since A(T) = Г(Т)), hence it 
suffices to prove either (iii) or (iv). 


First proof. (Corrected version of the proof given in ref. 2; it will prove the 
theorem in the form (iv).) 

The proof is by induction on the dimension k of the boxes. For k = 1, S may 
be regarded as a set of positive integers and B,(S) is the smallest subset of S 
such that every number in S is a non-negative linear combination of elements 
of B,(S). Suppose the greatest common divisor of elements of S is d; then it is 
easy to show that some finite subset F of S has greatest common divisor equal 
to d. Hence, since every large multiple of d is a non-negative linear combination 
of elements of F, all but a finite subset E of S is contained in A(F). Hence, 
S € A(FUE) and B,(S) = FUE. This proves B,(S) is finite in the l- 
dimensional case. 

Now suppose B,(S) is finite for every set S of k-dimensional shapes for 
К =1,...,n—1withn > 2. But, suppose there exists a set T of n-dimensional 
boxes with B,(T) infinite (this leads to a contradiction). Without loss of gen- 
erality it can be supposed that B,(T) = T. Also, let the elements of T be put in 
a sequential order (/,, £5, . . .) with й, = (ti1,..., tin) fori = 1, 2,... . Each of 
the п sequences (f; : 1 = 1, 2,...),..., (tint i= 1, 2, .. .) must tend to infinity. 
For example, if the first sequence (¢,,: i = 1, 2, .. .) does not tend to infinity, 
then there exists an infinite constant subsequence (f, : i e Г) so that the infinite 
subsequence (й: i e I) of (й: i = 1, 2, . . .) contains boxes all having the same 
height. It follows from the induction hypothesis that the set {7,: i € Г} has finite 
basis with respect to A. A moment's thought may be required here because a 
detail is being passed over. At bottom, we are using the fact that having boxes 
all of one height really amounts to dealing with boxes of one lower dimension. 

Now we select an infinite subsequence R of T and show that B,(R) is finite. 
This provides the contradiction we seek since this implies there exists an element 
(in fact, infinitely many elements) of T which is composite. (Supposing T = B,(T) 
means no element of T is composite.) 

Given n-tuples 2 = (a,,. ..,a,), Ё = (b,,. . ., bn) of positive integers, à is said 
to divide b if a, divides Б, for i = 1, . . ., n. This notion of division leads to a 
notion of greatest common divisor of two or more n-tuples of positive integers. 
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We form the subsequence R = (f, 72, ...) as follows: Let F, = £,, and note 
that 7, has only a finite number of divisors. Hence, 7, has the same greatest 
common divisor d, with each one of an infinite set of subsequent terms of T for 
some divisor d, of 7,. Let T, denote such an infinite subsequence of (5, £5, . . .), 
and let #, denote the first term of Т». Now this process is repeated with T, in 
place of T and F, in place of 7,, thus forming a new subsequence T; of Т» with F, 
deleted, and F, is defined to be the first term of T3. Thus, the ith term of R is 
found by i — 1 repetitions of this process. It follows from this construction that 
R has the peculiar property that, for each i, the greatest common divisor of F, 
with all subsequent terms of R is the same n-tuple d; for i = 1, 2, ... . Any 
sequence with this property is said to be stable. Note that every subsequence of 
a stable sequence is again stable. Now we show by construction that B,(R) is 
finite. 

Let 

6; = 4,1, fo j=l1,...,n 


where 
dj, = (don—1,1, mE dn 1,5); 


then we shall show there exists an integer p such that A ({7,, . . ., 7,4) contains 
all boxes having shape (p, Ôi, . . ., p, Ôn) with pi, . . ., p, > p. This implies 
B (R) is finite since all but a finite subset of the elements of R have this form. 
Let (s) denote the least common multiple of ғ, , .. ., rs; for i s = 1,...,п 
where P, = (ri, ..., Fin) fori = 1, 2,... . Now we show by induction on j: 

(4). For every stable sequence there exists a number p; such that every shape 


(0101, - + +5 Gy 0, uj... ГА)! (1) 
with 91, .. .,9 Z p; is an element of A({F,, . . ., #2). 
Forj = 1, x,y = 1, 2,..., the shape 


(rii x + F21 У, ux), LE) ш(1)) 


тау be formed from F, and F, by repeated one-dimensional concatenation. But, 
there exists an integer p, such that q, д, has the formr,, x + r;, y with x, y > 1 
for all q, > p, because the greatest common divisor of r,, and r,, divides ó,. 
This proves А,. 

Now suppose A, is true for some j > 1; we shall prove 4, ү. Let и, (5) denote 
the least common multiple of r,5,,,,, . . ., 755«1,,, and note that A, also applies 
to the stable sequence (7,: t = 27 + 1, . . ., 2/++), Thus, there exists a number 
р; such that every shape 


(di 61, zm б, k+ (J), 699 Un (JY) (2) 


where 41, .. ., d; > py is an element of Л({#»у+1,.. ., 723+1}). Now we apply 
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repeated one-dimensional concatenation to shapes given in (1) and (2) to obtain 
the shape 


41905 «54105 X ha) + Y aa О)» Kira” O) + + n G)) G) 


for all q,,...,q; 2 max (py pj), x; y = 1,2,... where yw,” denotes the least 
common multiple of u, and u,'. Now observe that the greatest common divisor 
of 4;+1(j) and ui, .'(j) divides 0..1, so there exists an integer 


руъі > max (py py) 


such that every number q;,, 04, with q;,, > pj+1 has the form 


X js) + Y ui (Л) 


with x, y > 1. Also, note that 4,(j) and y,'(j) have the least common multiple 
(j + 1) by definition for j = 1, . . ., n. Hence A, implies 4,,,. This completes 
the proof. 


Second proof. We shall prove (iii); the idea is modelled after the following proof 
(slightly longer than necessary) for the case k — 1. Let S be a set of positive 
integers. If S is empty, nothing has to be proved. If S is not empty, choose an 
he S. For any r (0 <r < h) with the property that A(S) contains a number = r 
(mod л) we select such a number. The set of selected numbers is denoted by D. 
Let q be some positive upper bound for the elements of this finite set D. We 
now have A(S) = A({h} UD U {1,...,9— 1}) since every element of S exceed- 
ing g — 1 is the sum of an element of D and a non-negative multiple of Л. 

We next proceed by induction with respect to the dimension k. We take 
n > 1 and assume the theorem correct for k = n — 1. One of the dimensions is 
singled out and referred to as “height”. If а = (a, . . ., а„) is an n-dimensional 
shape we write (2,, . . ., 4,.,) = &* (the “cross-section”) and a, = height (о). 
Furthermore, if t is a positive integer we write a** = (2,,.. ., Q,—1, t). 

As shapes of constant height can be treated as shapes of lower dimension, we 
note that if а, 04, . . ., а аге such that a* є A({a,*, . . ., «,*}) then 


a*t e Alo; г): 


Let S be a set of n-dimensional shapes. We put Y = {a*: х e S}. By the 
induction hypothesis Y has a finite subset Z with Y © A(Z). Take &;,..., &pE S 
such that Z = {a,*,..., a,*}. We put 


p 
Х = {«,,...,} A= [| height (aj). 


i=1 
Since / is a multiple of height (о), we have «,*" e A({«}). If « e S then a* є Y, 
whence a* e A({a,*, . . ., &)*}). Therefore a*" e A({a,*",..., a, *" у). Since A is 
a multiple of height («,), we have «,*" e A(X). Thus we have proved: if « e 5 
then a*^ є A(X). 
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For0 <r < h we consider 
T, = {a* :aeS, height (а) = r (mod Л)}. 


By the induction hypothesis T, has a finite subset R, with T, © A(R,). We select 
a finite subset Q, of S such that R, = (a* : «e Q,). Let q be a positive integer 
such that height (о) < q for all хє R, and all r (0 S r < А). 

Let у be any element of S whose height m exceeds q; let r satisfy 


Oxr«lh, m-— rz: 0 (mod h). 
For every хє Q, we have a*" e A(X), and since 
m >q È height (а), m = height (о) (mod Л), 
we conclude «*" € A(X U Q,). Furthermore y* e T, = A(R,), whence 
y = y*"c A((a*": xe Q,) = АХО O,). 


Consequently: if В є S, height (В) > q, then B e A(X U Qo U... U Q,.,). 
If 1x i <q we consider the sets 


W, = {a:aeS, height (0) =i}, V, = {a*:aeW,}. 


By the induction hypothesis V, has a finite subset U, with V, © A(U,). Let P, 
be the set of all shapes with height i and cross-section in V;. Then W, © A(P,). 
Our final conclusion is that 


ae A(X Y RY... U RUPU... U V) 


for every « e S. This proves (iii). 


The theorem has the following interesting corollary. Let S be any set of 
shapes. A box with its shape in /'(S) is called cleavable if some hyperplane cuts 
it into two non-empty boxes with shapes in T(S). Thus a non-cleavable box is а 
box which can be tiled with boxes having their shape listed in S, but never with 
a “fault” hyperplane. The corollary is that the set of shapes of non-cleavable 
boxes is either empty or finite. Note that every shape in Вг is non-cleavable, and 
that every non-cleavable shape is in B,. 
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1. Introduction 


For a long time Sir Edward Appleton and Balth. van der Pol were leaders 
in radio science. Appleton was many years president of U.R.S.L, the Inter- 
national Union for Scientific Radio, and Van der Pol was a vice-president. 
Appleton was chairman of the Commission on the Ionosphere, and Van der 
Pol chairman of the Commission on Radio Waves and Circuits. Both men were 
made honorary president of this Union. 

Appleton, born September 6, 1892 and Van der Pol, born January 27, 1889, 
became friends at Cambridge in 1919. Van der Pol had passed his doctoral 
examination (the examination that gives the right to write a doctor's thesis in 
the Netherlands) in Utrecht in 1916. His interest in radio brought him to the 
laboratory of professor Ambrose J. Fleming at University College London in 
1917, and in the years 1918 and 1919 to the Cavendish Laboratory of Cam- 
bridge, where Sir J. J. Thomson was the Director. 

Appleton had made his studies in Cambridge from 1911 to 1914. He joined 
the Army in 1914, was an Officer-Instructor in the Signal Corps and returned 
to Cambridge in the spring of 1919. After a year as a research student he was 
made an assistant-demonstrator of physics, and a Fellow of St. Johns. 

Van der Pol returned to the Netherlands in 1919, and worked for three years 
at the laboratory of the Teyler Foundation in Haarlem under professor H. A. 
Lorentz. 

At the time there were not many people working on the scientific aspects of 
radio, neither in the Netherlands nor in Cambridge, and Van der Pol, who was 
writing a doctor's thesis based on his Cambridge work, proposed to Appleton 
to join forces and exchange ideas with the aim of coming to joint papers. This 
led to many years of scientific correspondence, most letters being devoted for 
90% to calculations and experiments, and to scientific discussion, and only for 
a small part to family matters. In the first years, up to 1924, the correspondence 
was very frequent. Later the interest of both men went into somewhat different 


*) Formerly with Philips Research Laboratories, Eindhoven, The Netherlands. 
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directions, Appleton concentrating on his ionosphere work and Van der Pol 
on mathematical aspects of radio science, and the exchange of letters di- 
minished. 

A large number of these letters were preserved, and copies were given to the 
Philips Research Laboratories by Mrs. le Corbeiller (the widow of professor 
Balth. van der Pol). Although the jointly written book to which many letters 
refer, never came into existence and only two joint papers were produced, the 
correspondence throws some interesting light on the early history of mathemat- 
ical radio science. The development of Van der Pol’s ideas on what later was 
to be known as the “Van der Pol equation” can be followed. Professor H. A. 
Lorentz, well known as a theoretical physicist, but also the government’s main 
adviser on the reclamation of the Zuiderzee, was already 66 when Van der Pol 
came to work with him, but the new subject of non-linearities in radio valves 
interested him. Van der Pol had a great admiration for Lorentz and appreciated 
his help greatly. 

After three years with Teyler’s Foundation Van der Pol went to the Research 
Department of N.V. Philips’ Gloeilampenfabrieken, and the question why a 
scientist with a strong theoretical interest chose to go to an industrial research 
laboratory is also touched upon in the correspondence. 


2. Н.А. Lorentz and the work on non-linear effects in valves 


The first letter in which reference is made to Lorentz is Van der Pol’s letter 
of May 4th, 1920, of which we also quote another part because it shows Van 
der Pol's interest in preparing a good lecture with demonstrations. He was well 
known for that throughout his career and people who heard him on such 
occasion, would not forget it. 


T 


Fig. 1. Illustration of a retroaction experiment (after the drawing in Van der Pol’s letter of 
May 4th, 1920). 


“I gave a lecture to a society of science teachers and showed them the fol- 
lowing retroaction demonstration. A is an ordinary electric buzzer or bell work. 
M consists of two coils, say 50 turns each of diameter 3 and 4 centimeters, with 
iron core. When key B is closed and the two coils M are slipped over each other 
the thing begins to work and can be heard by the audience. Obviously it was 
intended to demonstrate the action of valves and there is a close relationship 
between the two. Amongst other things I showed the effect of variation of the 
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dielectric constant by pumping the air out between the plates of an air con- 
denser. In conclusion I made the audience hear the Eiffel tower signals, which 
is always an interesting experiment. Of course large response in the papers. 
Professor H. A. Lorentz was amongst audience! Also Professor Ehrenfest.” 

“It became necessary to start a scientific wireless society here, which I am 
founding with two engineers. We have the support of a few eminent physics 
professors, so I hope it will run smoothly.” 

“I think, I have told you already that professor Lorentz is now very much 
interested in triodes. He gave a few lectures on this fascinating subject, and did 
some important original work. It is about the solution of the oscillatory equa- 
tions but not with a linear characteristic but a curved one: 


i = aV, + BV + yV. 


The amplitude to which the oscillations build up, which is indefinite in the 
linear problem, can thus be found! Also amplitude of the higher harmonics. 
The third term yV,’ (V, is Eccles’ “lumped voltage") is the dominating term 
determining the amplitude. If you are interested, I may possibly send you the 
analysis.” 


The lumped voltage to which Van der Pol refers here, is V, +- g V,, where 
V, is the potential difference between plate and filament, V, is the potential 
difference between grid and filament, and g is the voltage ratio of the tube. 
Van der Pol worked further on this idea and published two papers on “A theory 
of the amplitude of free and forced triode vibrations” *:?). Later Van der Pol 
extended the power series to five terms and on July 14th, 1921 he writes to 
Appleton on the starting and stopping of oscillations: 

* As this starting and stopping business is only dependent on beat notes the 
characteristic that is only of importance is a symmetrical one that can be ob- 
tained by folding along AB and taking + the algebraic difference of the ordi- 
nates. For 


ip = —aV + BV? + yV? + 6V* + eV? 
ip = aV + BV? —yV? + óV* — eV? 
i (ip — ig) == —aV + yy? + eV^. 


In this way we can explain with a new method of approximation of the 
non-linear equation: 
a) the suppression of an existing vibration by another forced vibration. 
b) the increase of an existing vibration by another forced vibration. 
с) quiescent aerial telephony. 
d) Vincent. 
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e) your l.f. amplification with a h.f. oscillating triode. 
f) difference starting and stopping. 
g) "Ziehen". 

I got Lorentz interested in this matter and he spent several whole days working 
at it, finally confirming my Ziehen treatment. He has now the analysis of a. 
I am working at b, c, d, e and have finished nearly, a, f and g. You see there 
is a world in the y’s and e’s terms." 


Vincent ?) studied the effect on a third resonant circuit of two slightly coupled 
oscillators, when the frequency of one of them was varied and passed the fre- 
quency of the other one. "Ziehen" refers to the interaction between free and 
forced oscillations. 

After a visit of the Appleton's to the Van der Pol's in Haarlem in the summer 
of 1921, the letters are on a first name basis *My dear Vic" and *My dear 
Balth”. 

The two joint papers by Appleton and Van der Pol ^5) appear in the Philo- 
sophical Magazine in August 1921 (communicated by Sir J. J. Thomson) and 
January 1922 (communicated by H. A. Lorentz). 

Yet progress was not always as fast as the two ambitious young men had 
hoped. On December Ist, 1921, Van der Pol writes: - 

“Please excuse my waiting so long before answering your kind letters. I have 
been working very hard at Ziehen with very low coupling and have spent two 
weeks in trying to solve a fourth power ordinary equation exactly, without 
success. That is the reason why I waited so long. I can’t stop looking at the 
equation and am now bored.” 

However, nine days later, he is optimistic again (Dec. 10th, 1921): 

“Today I started a special writing book for putting in all the material of 
the nonlinear theory about which I am absolutely sure: I can use it as a refer- 
ence and it is all nearly ready for writing it down in its final form.” 

And on December 26th, 1921: 

“Your letter is full of interest to me, as I have just recently got out theoreti- 
cally all your experimental results.” 

A letter of June 12th, 1922 shows Lorentz's interest again: 

"I sincerely hope that Lorentz who promised me to go through several parts 
of the later work on suppression and multivibration early next week, will clear 
up some remaining difficulties. For he was very busy the last few weeks and 
could therefore not give me any assistance, whatsoever." 

"Talking to Lorentz while walking home one morning resulted in the fol- 
lowing manner of looking at the old question secondary emission contra snatch- 
ing. Assume that the filament is saturated, that therefore there is no space 
charge, one can put the question as follows. Given the potentials of the anode 
and grid we can place an electron somewhere in the triode and not give it any 
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initial velocity and ask where this electron. will go to. Three possibilities arise: 
it may go to the anode, it may go to the grid, or it may go to nowhere, by 
which is meant that it will like a planet revolve in a closed orbit... . 

Though of course the main question is not solved herewith, this way of 
looking at the problem undoubtedly clarifies it markedly. It bears the mark 
H.A.” 


Appleton to Van der Pol, May 26th, 1923: 

“Well, Lorentz visit was a real triumph for Holland. I am very enthusiastic 
about it. The lecture was on the Electromagnetic Theory of Maxwell and was 
very simple and elegant english. He made one or two very excellent jokes and 
the undergraduates roared with applause. What appealed to them was his 
extraordinary modesty. He never said “I did” once, and was most generous 
in his appreciation of other peoples work. All the big people were there. 
J.J., Rutherford, Larmor, C.T.R., Baker, Newall, etc., Eddington. J.J. in a 
wonderful vote of thanks referred to the lecture as “a generous tribute to one 
of the world’s greatest physicists by one who is his peer” (i.e. equal).” 

J.J. is Sir J. J. Thomson, C.T.R. is Wilson. 


Van der Pol to Appleton, September 10th, 1922: 
“A few days before I left Teyler Lorentz worked at our general equation 
and found a very nice result. Let in general 


9 + a(v) v + B(v) v = 0 


where a(v) and fv) are functions of v. 

He worked with the theory of variations and found the following thesis: 

Let at the moment 1,, v have a certain value vo and v have a value v. Let 
at the moment t5, v have the same value again оо and ù have a value v2. We 
therefore compare the velocities 0; and 9, when the system goes through the 
same position. Now we may ask: if we give the system a small impulse (con- 
sider a varied motion) so that it does not go first through the position vo with 
the velocity v, but with the velocity 0; + óv,, then we may ask what will be 
the change of ù, (09) when at another time (not necessarily the next time) it 
reaches again the position vo. The relation is in general given by 


5(6,)? = (0, exp — | a(v) dt 


1 


and is curiously enough wholly independent of B(v)!” 
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Van der Pol to Appleton, July 19th, 1923: 

"Yesterday I went to H.A. to congratulate him with his seventieth birthday. 
It was grand to see "the old man of science" in the midst of a lot of flowers 
with telegrams pouring in from all over the world, in the meantime playing with 
his grandchildren and laughing and joking like a boy of twenty. He was full 
of his visit to Cambridge and was very pleased to have seen you." 

Van der Pol had another opportunity to show his appreciation and respect 
to Lorentz on the occasion of the fiftieth anniversary of the latter's doctor's 
degree $). 


Finally there is the nostalgic note in Appleton's letter on the death of 
Lorentz, February 14th, 1928: 

“Му dear Balth 

I very much valued your kind letter dealing with Lorentz’ last illness. I felt 
the blow very much for 1 remember with gratitude his (and yours too) kindness 
in allowing me to work at Haarlem. These were great days, were they not? 
We in England feel that we have lost a very dear friend as well as the greatest 
physicist of his time. Time has taken us both into other fields of investigation, 
but we must always remember the starting points under J.J. at the Cavendish 
and under Lorentz at Haarlem.” 


3. The offer of Philips and its acceptance; first successes 


Van der Pol had played with the thought of a professorship at Delft Technical 
University and given a lecture there, which was well received. However, it seems 
that it was a time for economy and a professorship was not yet in sight. 

The director of the Philips Research Laboratories, dr. Holst, came to Haarlem 
and offered a position. As we shall see, Van der Pol did not decide at once, 
though his friend Appleton immediately saw the great opportunity. We will 
follow the correspondence through to the first successes, the high-power triodes, 
which were rather unique in the world at the time and led to very good results 
in communications, e.g. with the Netherlands East Indies, via the own station 
РСЈЈ. 


Van der Pol to Appleton, April 26th, 1922: 

“Some time ago, I think, I wrote you about the chairs for technical physics 
in Delft that were going to be founded. Since then in toto five professors died 
and they are so extremely careful now in Delft, that of these five places three 
will remain unoccupied. Under such circumstances it is clear that there is little 
prospect that I get a professorship there within say the first four years, more- 
over so as I hear that the lectures in technical physics will be given by the 
present professors. 

Now some time ago Philips offered me a job, and last night Holst (you know 
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him) was here to discuss things more fully. They want me to develop experimen- 
tally an idea which some preliminary experiments proved to be practical. I can 
not write details but very confidentially I can state the main subject: It is about 
triodes without filaments. 

The job is attractive from different points of view. First: there are now in 
Philips's laboratory many very first class people as: Hertz (of the Frank and 
Hertz experiments), Holst, who discovered the supra conductors and soon they 
will have Professor Gerlach of Frankfurt who found the magnetic moment of 
uncharged silver atoms by splitting up a beam of free moving silver atoms in a 
Knudsen vacuum by a magnetic field. 

Further quite new laboratories are being built in the American G.E.C. fashion 
and my job would be wholly independent of the works. It concerns therefore 
purely scientific work. Moreover as Holst told me, they are working very freely. 
E.g. for several reasons they want to know a great deal more about gas dis- 
charges than is generally to be found in the textbooks. Hertz is the man for that. 
He simply spends all his time for finding out what really happens in a glow 
discharge and it appears that J.J. and Townsend are altogether wrong in many 
respects. Working in this general scientific way already pays. They have the 
example of Langmuir. 

Pensions belong to the job as well as cheap living in a Philips house. And 
though this is not definitely settled yet they offered me practically £ 1000 to 
start with. 

The drawbacks are: seven working hours a day; one month holiday a year 
and of course in such a job one is not as independent as in a university post. 

The reason for my writing all these details to you are twofold. First I know 
that you are interested in our doings but secondly I wanted to have your advice 
in the following matter: Do you think that there is any prospect for me for 
getting some Cambridge university job within a year or so with any degree of 
certainty and, if so, could probably anything of the kind be discussed either 
with J.J. or Rutherford when I come to Cambridge. As you know the first 
thing is not a big income but sufficient to live, say some £ 700 or £ 800. But I 
want to have a fair measure of time for research. For though we are very glad 
with the job offered me here, we should still prefer to live in Cambridge. Апа 
there is a perhaps not so very well-known rule when playing chess: when you 
see a good move, don't do it; first look out whether there is not a better one. 

All this would of course have its influence on the book. If I accepted the 
Philips job, this would mean that most probably I did not have the opportunity 
for doing much experimental work directly for the book, though of course I 
would do everything possible on the theoretical side in my evenings. On the 
other hand, if I could get a Cambridge job, it would mean that all my research 
time would be spent on the book. 

Of course I will not decide about the job before Professor Lorentz is back. 
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He is expected about May 10th. And if there is any reasonable chance for a 
Cambridge job I will not decide before I have been in Cambridge. It would 
therefore be most agreeable to me to come to you about the end of May. Does 
this time suit you and Jessy? 

Summa summarum: I should be very pleased to get the following data: 
First your preliminary advice about the Cambridge job, and perhaps later on 
you could let me know more definite things when you have spoken with J.J. 
or Rutherford." 


Appleton to Van der Pol, May 2nd, 1922: 

“So much of this letter was written when I received your extremely welcome 
letter containing the news. I communicated the gist to Jessie and we both wish 
to offer our hearty congratulations to you both because we both feel that it is 
a good opportunity since it will lead to bigger things. We had, as I expect you 
both had, a shade of disappointment when we heard that the chairs at Delft 
were not to be filled. But we do think the other offer is to some extent a very 
welcome recompense. I have not yet had time to see Rutherford about the 
question of your coming here but I am going to put before you the facts as 
I find them. In the first place one could only make up £ 700 or £ 800 a year 
with a Fellowship (£ 200). There is no lectureship higher than about £ 250 per 
year. And of course Fellowships are reserved for members of colleges as you 
know. At present there is a tremendous drop in prices and a wholesale cutting 
of wages and salaries. Jobs are hard to get and people who have them are 
sticking for all they are worth to keep them. Thus I feel that if you came and 
got say a lecturing post £ 300 per year with only about five hours work per 
week the rest would have to be made by coaching which of course would destroy 
all the originality for research which you possess. Coaching is at the rate of 
£ 12 per term for three hours per week. There are three terms per year so that 
a lot of time must be given up to coaching to make an income of it. 

Moreover the teaching of the place is always done by men of Cambridge 
Colleges who have been through the Triposes. If you came to Cambridge as 
an advanced student and joined a college for two or three years I don't think 
it would be long before you got a Fellowship like Chadwick has done recently 
(He came here three years ago at Canon College) but of course there is always 
a risk. But I truly and Jessie too think that anything even approaching the 
present offer is quite out of the question. When my Fellowship lapses as it will 
after the six years is up I shall be reduced to about £ 650 a year and if I wanted 
to do more research than I do now I should have to drop some teaching and 
get even less. 

However you know that we both should dearly love to see you here and if 
there is the slightest possible chance I will do all I can. Thus I will sound 
Rutherford at the earliest opportunity." 
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Van der Pol to Appleton, May 4th, 1922: 
“Many thanks for all the information about Cambridge. I wonder what 
Rutherford will say about it.” 


Appleton to van der Pol, May 17th, 1922: 

“And now as to the important matter re Rutherford. I have talked to him 
and he told me that rather than increasing his staff he must reduce it and so 
he is wanting — and — to get jobs outside. This of course is very confidential. 
The number of students is now going down since the heavy mob of students 
after the war is over and he finds himself overstaffed for teachers. There is 
however another line. You think J.J. would take you on as his private research 
assistant. Aston once held this post. I don’t know how much he got though.” 


Van der Pol to Appleton, May 20th, 1922: 

“Many thanks also for talking to Rutherford. I think that after all you told 
me we will decide to go to Philips. Hence it would be best not to talk any- 
more on any Cambridge job.” 


Van der Pol to Appleton, June 12th 1922: 

“In the first place we have decided to go to Philips. We carefully weighted 
the pro’s and contra’s and found a positive rest for the pro’s. There are however 
two important contra’s left: the first one is giving up the personal relationship 
with H.A. and the second one is closely related with the book. For I fear that 
in the next future I shall not be as free as I was here in Haarlem to spend much 
time at the problems of the book.” 


Van der Pol to Appleton, September 10th, 1922: 

“From the 17th-24th there is a big scientific German conference at Leipzig. 
I'm going there for Philips to attend the meetings (with Holst, Oosterhuis and 
Hertz).” 

“Finally a few words about the new job. Well Iam very busy indeed, which 
means: I am at the lab: 84-12, 2-6, so that not much time is left for working 
at home. However my job is very independent and I choose my own problems 
while keeping an eye of course on technical applications. It is a really nice 
thing to be able to order in the morning to have 10 triodes made of special 
design, do nothing, and have them brought in in the afternoon by my assistant!! 
No glass blowing by myself is necessary, I have everything done for me. The 
only thing I have to do in my work will be to read instruments. I wonder whether 
J shall ever have dirty hands again in my life! I think, when my special room 
is ready, I will tackle secondary emission profoundly, and let you know results.” 
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Van der Pol to Appleton, December 21st, 1922: 

“I was in London with Holst more or less as a “commercial traveller” in 
25 Kilowatt triodes! We visited the Marconi Co, dear old Fleming and Eccles. 
They were developed here, as I told you, and now it happened that quite inde- 
pendently Langmuir did similar experiments. Of course the whole thing is an 
application of a new joint between glass and metal. The external appearance 
of our triode is similar to Langmuir’s, but the material used is much better 
(i.e. our material is). The anode is outside and is watercooled. You ought to 
have seen Fleming’s face as he was standing with this thing in his hands. Of 
course he started talking of the Albert medal, etc. etc. He said to Holst: it is 
here that the baby was born.” 


Appleton to Van der Pol, January 9th, 1923: 
“I hear from Turner that Eccles was very much impressed by the high power 
valves. He thinks it is one of the biggest things yet done in wireless.” 


Van der Pol to Appleton, February 8th, 1923: 

“In the meantime I devised a 50 Kilowatt transmitter which approaches com- 
pletion. One big metal triode! Recently I showed to general Ferrié, who was 
staying in Holland, a 2 Kilowatt transmitter with an outside anode not bigger 
than ordinary R one, which whole triode one can easily put in one’s vest pocket, 
and which can transmit... two Kilowatts! Eh?” 


In the meantime the work on secondary emission and on nonlinear theory 
went on, but the first clear success were the big triodes: 

Van der Pol to Appleton, July 19th, 1923: 

“I dream of the equation day and night and spend all my spare time on it. 
At the lab nice progress is being made with the big triodes and within one or 
two weeks I will come round to Carnarvon again to try twelve in parallel yielding 
some 200 Kilowatt, the biggest amount of energy so far produced with triodes 
in the world. This afternoon I had a hundred ampéres H.F. current in the set 
. you saw in my room. The screws in the wood were red hot and the wood was 
burning! All this takes a lot of time and thinking.” 


Van der Pol to Appleton, August 19th, 1923: 
“In Carnarvon I got 200 Kilowatts in 8 triodes at a high efficiency, but this 
is still a secret”. 


Appleton also ventured some commercial tips; September 14th, 1922: | 
“Do you think that it would be a good plan for your firm to take up the 
manufacture of a sealed off cathode ray oscillograph. I certainly do.” 
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Meanwhile “the book” did not make much progress, and though there is 
no doubt that Appleton and Van der Pol intended to go to the bottom of all 
problems, other workers in the field managed to get important results. 


Appleton to Van der Pol, May 31st, 1923: : 
“Have you seen Gill’s article (Phil. Mag. for May)? Не has taken the cream 
of our milk I am afraid. We must hurry on with our publication.” 


Van der Pol to Appleton, June 5th, 1923: 
“Yes we must hurry. We can't also omit referring to Tank. But we could 
state that we found our result independently." 


Soon Appleton was going to get important results in research on atmospherics 
and in the work on the ionosphere that later was going to win him the Noble 
prize. Van der Pol would indeed solve the problems of the non-linear theory 
and find the theory of relaxation oscillations (e.g. the heart beat). The mutual 
friendship remained. 

On another occasion we may show from the letters the influence of the 
Cambridge contacts on the development of the Van der Pol equation. 

The letters throw an interesting light on two of the greatest men of radio 
science in their early years. In the last letters, August and December 1954, 
Appleton is interested in the CCIR documents on the ionosphere (Van der Pol 
was Director of CCIR at the time) and Van der Pol is interested in Appleton's 
work with respect to the F, layer, and the tone of the letters is as friendly as 
ever. 
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Abstract 


A general linear field supports both a discrete and continuous spectrum 
of modes or source-free field solutions. Although only eigenmodes are 
necessary for a complete representation of fields excited by sources, 
noneigen discrete modes (quasimodes) provide a useful and rapidly 
convergent alternative to the continuum eigenmode part of a field 
representation. Quasimode field representations, and their interpreta- 
tion in terms of quasiparticles, are discussed both generally and for the 
special case of a vector electron plasma field, for which the complete 
set of eigenmodes is found via a resolvent or characteristic Green's func- 
tion method. 


1. Introduction 


As is known, many wave types or modes are capable of excitation by sources 
in general linear systems. Some are discrete or continuous eigenmodes that, 
taken together, provide complete representations of excited fields. Other types 
may be noneigenmodes, not members of a complete orthogonal set, but they 
nevertheless play a useful role in field representations. If the system admits a 
continuous eigenspectrum, both eigen and noneigen modes are usually present 
and one must determine which of the possible discrete modes are eigen and 
which are not. Discrete modes may be chosen to be either oscillatory, guided 
wave, etc., each being distinguished by a specific wave structure or polarization. 
In the following we shall restrict the discussion to oscillatory modes with 
prescribed spatial periodicity. Although the analysis will be applicable to general 
linear systems, a specific composite (electromagnetic-charged particle) system 
will be treated in some detail to illustrate how one distinguishes between eigen 
and noneigen discrete modes, how one determines complete sets of eigenmodes 
by Green's function techniques, and how one utilizes the noneigen modes. 
Since discrete modes are source-free wave solutions, we shall first review the 
general features of their determination. 


*) It is with great pleasure that I offer this contribution as part of a well deserved testimonial 
issue for Chris Bouwkamp. These comments on modes recall for me Chris' elegant use 
of eigenmode techniques in his classic treatment of diffraction by a circular aperture. 
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Source-free, homogeneous, and stationary linear systems сап be generally 
described in terms of homogeneous first-order field equations of the form 


(> К is) pr, t) = 0 (1) 
i ot 


where L is usually an п Xn matrix operator and y an n-component wavevector. 
Such equations admit plane-wave solutions of the form exp [i(k.r— t)] 
where the spatial and temporal periodicities, К and o, are related by the de- 
terminental equation 


det L(k, w) = 0. (2) 
Oscillatory solutions of (1) have the form 
yr, t) = Vk) exp [i(k . r — œk) ¢)] (3) 


where for k given w,(k) defines the ath root of the determinental equation (2) 
and distinguishes the dispersion relation for the ath oscillatory mode; (К) 
is determined from eq. (1) on use of (2) and (3). As is generally known, the 
oscillatory modes (3) give rise to a complete set of orthogonal eigenmodes in 
the polarization (spin) space spanned by the eigenvectors Y,(k), а = 1, 2, 
s, п, for fixed К. 

The relevant eigenvalue problem is readily obtained by decomposition of 
the matrix operator L into spatial and temporal components, viz. *) 


V Wo 
L = M| — ——, (4a) 
i i òt 
where M and W are nxn matrices. Thus for oscillatory solutions (3), eq. (1) 
assumes the form of an eigenvalue problem: 
M Pa = о, W Ya (4b) 
where M, W,, о, are k-dependent; the adjoint eigenvalue problem is 
M+ Pt = о, Wt Yt (4с) 
where M+, W+ are transposed conjugate (Hermitean adjoint) matrices and 


V. *, w,* are the adjoint eigenvectors and eigenvalues. From eqs (4) one con- 
ventionally obtains the biorthogonality property of the eigenvectors as 


(Pat W WV) = N, P (5a) 


where ( , ) defines the Hermitean inner product in the space spanned by the 
vectors V,, N, is a k-dependent normalization constant, and д, р is the Kro- 
necker delta which is unity if о, = ор and zero otherwise. It is desirable to 
rephrase both the orthogonality property (5a) and the completeness of the 
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VP, in terms of a “completeness relation”, which provides а representation of 


the identity operator as 
i W Pa Pat (5b) 
= у F Е 


from which опе can infer (Sa). In general the “summation” index spans а 
discrete х and a continuous @' spectrum of eigenvalues w,; in the case of 
continuous indices a’, 6’ the Kronecker delta in (5а) is to be replaced by a 
delta function ó(a' — A’). 

For n finite and k given, the zeros w, of the determinental equation define 
the л discrete eigenvalues of the operator M (provided M is a “complete” or 
normal operator). For infinite л, the picture may be quite different. In this 
latter case, some of the zeros of eq. (2) distinguish discrete eigenvalues of М 
but others do not. Since the zeros of the determinental equation yield dis- 
persion relations for the discrete oscillatory modes which the system can sup- 
port, the question arises as to how to distinguish the zeros characteristic of 
eigenmodes and those characteristic of noneigenmodes. This problem arises in 
a number of different fields; its importance stems from the need to ascertain 
those zeros of (2) which correspond to source-free solutions that are members 
of a complete eigenset and those that correspond to noneigen solutions and 
hence are not to be included in the complete set. Depending on the field, the 
noneigen solutions are termed leaky-wave, complex-resonance, radioactive-state, 
Landau, etc., solutions. In the following we shall term them quasimode solu- 
tions. They arise only when the operator M possesses a continuous spectrum 
of eigenvalues, or equivalently if the operator L(k, w) is a nonanalytic func- 
tion of о. 

Quasimode solutions generally correspond to complex roots of the deter- 
minental equation (2). When M — M* is an Hermitean operator and hence its 
eigenvalues w, = w,* are real, the presence of complex roots clearly under- 
scores the noneigen nature of the quasimode solutions. Despite their lack of 
membership in a complete eigenset, quasimodes frequently play an important 
role in field representations. The solution of an initial-value problem for the 
field defined by eq. (1), or of a source-excited inhomogeneous version of eq. (1), 
can be represented as a superposition of eigenmode contributions via a well 
known procedure. Such field representations in general comprise both discrete 
and continuous eigenmode contributions. Quasimodes provide an alternative, 
and usually rapidly convergent representation of the continuous eigenmode 
contribution. In fact in problems wherein only a continuous eigenspectrum 
exists, one or two quasimode types may provide an adequate representation 
of the field response to arbitrary excitation; this feature represents one of the 
important applications of quasimodes. 
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In sec. 2 we shall introduce the characteristic Green’s function, or resolvent 
operator, for the linear field described by eq. (1). This operator, whose singular- 
ities are determined by eq. (2), permits a ready distinction between the eigen 
and noneigen roots of eq. (2); also alternative representations of such Green’s 
functions clarify the utility of quasimodes in the solution of field problems 
when a continuous eigenvalue spectrum exists. In sec. 3 we consider, as a 
special example, the linearized electron plasma field and summarize the charac- 
teristic Green’s function technique for determination of the complete set of 
eigenmodes and as well the quasimodes of this field for given k. In sec. 4 
quasiparticle concepts are introduced to illustrate how k-dependent quasi- 
mode wave packets evolve in space and time. 


2. Characteristic Green’s function operator 


The deeper significance of the determinental equation (2) becomes evident 
on introduction of an appropriate source-excited or Green’s “function” 
problem. For fields of spatial form exp (ik . r) one can define a unique time 
dependent Green's function (matrix operator) С(/, t") for eq. (1) by 


© W o 
ix ix) Gt, = [м ЕШ z] GU) = да—) (6a) 
ot i ot 


and the requirement that 
G(t, t) 20 for ї<1'. (6b) 
The Laplace representation 


cot ia 


баг) = = | G(o) exp [i o (t— t^)] do, (7) 


= о+іа 


with а chosen to be a suitably large positive number so as to satisfy (6b), 
introduces the spectral operator G(w), which on transformation of eq. (6) evi- 
dently satisfies the operator equation 


ЦК, w) G(w) = [M — w W] Glo) = 1. (8) 


The matrix operator G(w) = 1/L(k, w), the so-called characteristic Green’s 
function or resolvent operator, is a singular function of complex w. It mani- 
festly has pole singularities at the zeros of det L(k, о) and, if L(k, w) is non- 
analytic in w, branch line singularities. For kinetically described many-particle 
systems, such as the electron-plasma example treated in sec. 3, G(w) possesses 
a branch-line singularity on the real w-axis. In consequence, a two-sheeted 
Riemann surface, with the two sheets connected via a branch line cut along 
the real w-axis, must be introduced to depict the dependence of G(w) on com- 
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plex w. One sheet of the Riemann surface, the so-called “physical branch”, is 
so defined that it contains only the eigenvalue type of pole singularities; the 
other “nonphysical branch” will contain the noneigen singularities. 

The singularities of G(w) depict, among others, the natural resonances or 
eigenfrequencies of a system, and thus there should be a completeness rela- 
tion associated with G(w). This well known completeness relation can be simply 
inferred by division of eq. (8) by 2zic and counter-clockwise contour 
integration of the result over an infinitely large contour C centered at w = 0, 
whence there is obtained (cf. ref. 2) 


1 = W G(w) а 9 
ED (а) do, ©) 


where the physical branch, on which the contour C is taken, is defined by the 
requirement that 


$ 29 ai (10) 
w 


с 
The knowledge of all singularities of G(w) on this physical branch leads, on 
evaluation of the residue and branch-line contributions from the contour integral 
in (9), to a result of the form (5b) — i.e. to the identity operator which reveals 
the eigenvectors, and their normalizations, for both the discrete and continuous 
spectrum. 

The above distinction between the physical and nonphysical branches of the 
Riemann surface for G(w) = 1/L(k, w) provides the basis for distinguishing 
between roots of det L(k, w) that correspond to eigenmodes and to quasi- 
modes. The poles of G(w), or roots of det L(k, w), that lie on the physical 
branch distinguish the eigenmodes, whereas the poles, or roots, on the non- 
physical branch define the quasimodes. Figure 1 is illustrative of a singularity 
picture for a typical component of the matrix operator G(w). The presence 
of a single pole (root) at w = 0 indicates the system in question supports only 


«—plane w-plane 


(a) physical sheet (b) nofphysical sheet 


Fig. 1. Singularities of G(w); circles: poles, heavy lines: branch lines. 
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a single eigenmode, whereas the multiple complex poles (roots) on the non- 
physical branch are indicative of multiple quasimodes. 
The preceding observations imply the identity 


1 W G(w) do = (11) 


л 
с 


aa’ 


which follows from the explicit evaluation of the contour integral in (9). For 
the example of interest (cf. sec. 3 and fig. 1), G(w) has both pole and branch- 
line singularities on the physical sheet. The residue contributions to the integral 
in (11) from the poles of G(w) will be shown to yield discrete («) terms in the 
eigen representation in (11) while the branch-line contribution will yield the 
continuous spectrum (o^) terms. 

It is of interest to compare the implications of the — EN in (7) and 
in (11). From (7) one infers, since —i W G(t'+, t^) = 1 is implied by eqs (6), 
that 

1 о+ ia 
I | W G(o) dow (12) 


— oia 


where an exponential convergence factor exp (—ic4) (with 4 = 0+) is im- 
plied for Im о < 0. On the other hand, on contour deformation eq. (11) 
implies 
ES cot ia o -— іа 
pc -f W G(o) dw — — -f W G(o) do (13) 


Tb apti —o-ia 


where the real positive number a is chosen sufficiently large so that deformation 
of the contour Cin (11) to that given in (13) is such as to retain all G(w) singular- 
ities within the segment of the w-plane bounded by the lines о = +ia and 
œw = —ia. If the integrals in (13), taken over the indicated upper and lower 
contours on the physical sheet, are deformed into the upper and lower shores 
of the branch line (cut) along the real w-axis, one obtains, on taking cognizance 
of the G(w) poles in the physical sheet, a residue sum and a branch-line integral 
that reproduce the completeness relation in (11). On the other hand if in the 
integral representation (12) the contour along Im o — -Fa in the physical 
sheet is deformed through the cut along the real axis onto the lower half of 
the unphysical sheet and into a contour approaching w — —ioo, then because 
of the implied convergence factor in (12) the integral over this deformed con- 
tour vanishes. The residue contributions at the G(w) poles encountered in the 
deformation arise from poles «4. which lie in the upper half of the physical 
sheet and poles ор in the lower half of the unphysical sheet; this residue sum 
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provides an alternative representation of (12) that may be written in the form 


W V, Yt W V, W,* 
1 = у == ) bn iri M (14) 
N, № 


а+ В 


In (14) the discrete quasimode contributions at о == w,(k) are phrased in the 
same form as the discrete eigenmode contributions at о = w,,(k). Complex 
poles at w,, are indicative of unstable modes (if they exist) whereas the com- 
plex poles ор imply damped modes; furthermore, it should be observed that 
the W, possess orthogonality properties whereas the V, do not (in the usual 
sense). 

The identity operator representations in (11) and (14) appear in clearer per- 
spective if alternatively phrased as Green's function (operator) representations. 
Thus, paralleling the familiar Green's function representation in (7), one has 
instead of (11) for t >t’ the eigen representation 


ШАР 
бю =! у x exp [~i w, (t— t')] + 


a 
а 


Pa Pat 
+i m Р [—i o, ((— t)). (15) 


a’ 


Correspondingly, instead of (14), one has for t > t’ the mixed representation 


V, Pat 
G(t, t') = 23 T exp [—i о, (t— 1)] + 


а 


+ 
Pr Pet 
+i › exp [~i o (t— 1)]. (16) 
№ 
в 


Although, because of the notation, (15) and (16) appear very similar, it should 
be noted that (16) is usually far more convenient to use in applications because 
the В sum is generally rapidly convergent whereas the o' “sum” in (15) really 
is an integral. Multiplication of (15) or (16) by a relatively arbitrary excitation 
vector ¢,(t’) and integration over /' yields a representation of the transformed 
response У, (2) to the source $,(£). Further multiplication by exp (ik . r) and 
integration over all k then leads to a representation of the space-time dependent 
response (r, t?) to the source vector ¢(r, t), as defined by the field equation 


Ly=¢. 
3. Electron plasma field 


As an explicit illustration of the above we shall evaluate the characteristic 
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Green’s function operator and complete set of eigenvectors for a collisionless, 
isotropic, electron plasma field ?). At the linear kinetic level a sourceless non- 
relativistic electron plasma is described by the normalized Maxwell-Vlasov 


equations: 
dE 


——cVxH-— | vfdv = 0, 
т i 
oH 
eV XE + — = 0, (17) 
t 


d 
—Vefo B+ (++. v) = 0, 


where the vector electric field E = E(r, t) is normalized to the critical field 
Е, == m waje, the magnetic field Н = H(r, t) to (€o/f4o)"/? Е,, and the veloc- 
ity integrals of the electron distribution function f = f (v, г, t) and its homo- 
geneous background component f; = fo(v) are both normalized to the back- 
ground plasma density по; е, т are electron charge and mass; £o, ио are the 
permittivity and permeability of vacuum; c = 1/(ио &9)!? the speed of light 
and v the electron velocity variable are both normalized to the electron thermal 
speed а = (2k, Tm) ^? ; wp = (по e?/m &;)!? is the electron plasma frequency; 
time t is normalized to 1/w, and distances r to the Debye length a/w,; all in 
MKS units. 

Equations (17) are manifestly expressible in the matrix form (1) if one defines 


1% V 

-—1 —c—x1 iv E 

i ot i 

V 10 

L—|c—x1 -—1 0 ,v—|H (18a) 

i i ot 

. . ò 

i Vofo 0 —i TERN 1, f 


with the matrix product L y reproducing the left-hand members of eq. (17). 
In conformity with general matrix notation, the elements of the matrix L are 
partitioned into vector or dyadic elements in 3-vector or co-vector space; 
1 — (д) is the unit dyadic in 3-vector space representative of E ог H, 
1, — (ô(v — v’)) is the unit “dyadic” in an oo-vector space representative of 
the velocity variable v, and v and V,fo are oo-vector elements. On trans- 
formation to the К, w basis one finds 


—o1 —kcxl1 iv 
ЦК, o)-—|kcx1 -—ol 0 (18b) 
iV,fo . 0 —(w— k . v) 1, { 
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from which the component operators M and W are readily identified; one notes 
the weighting operator W = 1. The characteristic Green’s function С(о), i.e., 
the inverse of Z(k, w) as defined in (8), is evidently representable by the matrix 


Gi, б, С 
G(w) >| б», G22 Gos (19) 
Сз, G32 Gs3 


whose dyadic, vector, and scalar elements can be inferred from (8). Thus for 
example the first column elements are defined for Im wlk 5 0 by 


kx(kx1)c? 1 Y V, 
-e[14 9727. fo JE (20) 
w? wk? u— wjk 
whence 
kcxi i V,f. 
21 — . Gi, G5, = : . V1j* 
w w— ku 


For isotropic fo(¥) it is convenient to introduce vector decompositions, longi- 
tudinal and transverse to the propagation vector k = k Ко, by 

vy=uk,+ Vv’, 

1, = ko ko = 1— 17, 17 = —ko X(ky х1), 


whence on decomposition of G,, into (diagonal) scalar longitudinal G,, and 
transverse G,, components via 


Gui = с. + Gi, lr, (21) 
one finds from (20) (on integration by parts using fo > 0 as v > oo) 
1 Ofo/du = w 
Gy =- hi- Yol a| , Im — z 0, 
w k? J u— w[k k 
(22) 
6 ak oh, 1 Jo dv | I а о 
= — — | 1— — , Im— P 
ЕЕ w w? wk? u— ofk 


Since the integrands in (22) have a singularity at u = w/k, the integrals in 
(22), and hence G,,, are undefined on the real w-axis for real k. To define 
unambiguously С, ,(w) the w-plane may be viewed as a two-sheeted Riemann 
surface with a branch line on the real w-axis. To distinguish the physical sheet, 
on which G,,(w) vanishes at w — оо, requires the ability to decompose the 
integrals in (22) into parts regular in the upper and lower halves of this sheet 

and vanishing at со. 

With the notation 
у= 


o І 220 т T) dy? — 1, 
"n mu) = i ЛУ) = golu) Рә"); | Fo(v") dv? = 
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Plemelj’s theorem provides “regular decompositions” of y(u) into parts 
ї]ь@) regular, respectively, in the upper/lower half planes of > and vanishing 
at со in their half planes of regularity, viz. 


dw, Imv 20, (23a) 


provided y(u) is integrable from —oo < u < +o. It is to be noted that 
although the decompositions 7,(v) in (23a) are regular for Im» 2 0, respec- 
tively, in the physical sheet thereby defined, they may be “analytically con- 
tinued” into the unphysical sheet where they may have singularities. On 
allowing > in the physical sheet to approach the real axis, one obtains from 
(23a) in the limit Im v = 0: 


ao 


á | ШЕЕ m lin 20, (23b) 


where P denotes the Cauchy principal part. As a special case of (23), one has 
the delta function decompositions 


1 1 
ó4(v — и) = d — : Im» 20 (24a) 
2лі u— у 
Р 1 (и — >) 
= 4 — + Я Im» = 0. (24b) 
2лі u—v 2 


With the regular decompositions (23) one can now unambiguously extend 
the definition of G,, in (22) to all values of v. The desired extension for the 
longitudinal components G,, (a = 1, 2, 3) becomes for all v in the physical 
sheet 


Ko K 
Gu L = —— 
ky et(v) 
б, -—0, (25) 
+ 2x 6.(v— и) 
G = ү) ko, 
31 v e*(») n(Y) ko 


where 
EC) = nlu) Ео(ү?), 
(у) = 1 + 2zi 92 (0). 


In a similar manner one generalizes the transverse G,, components in (22) to 
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all y in the physical sheet, via: 


2 1 
= — ——— 1 
11 2+0) T 


à 1 
= — ———-ck, xl, 26 
а= vag d (26) 
А (и) 27 д.(и— v) 
Сз; = F ==, Уут Fe(v7), 
2+0) 


where 
1 
Au) = k2 golu), 
2 


st(v) = »— = + 2лі (2). 
y 


With the knowledge of G,, one can employ the completeness relation (11) 
to evaluate explicitly the eigenvectors Y, and their adjoints V,*. If we denote 


E, E,* 
v =» H, , pt =» H,* (27) 
Лу) VU 


then from (11) and (25) one finds that the longitudinal E, components are 
given by 


E, E,** _ kok ky k dy 
= PEE x С, 1(@) dw = = оо Po . 
Nz 2лї e? et(v) 


« 
For a "passive" plasma with fo(v) Maxwellian (for example), v e*(v) has a 
zero at > — 0, whence evaluating the residue at > — 0 and the branch-line 
contribution along Іту = 0 within the contour C, one has 


E,E,** КК, ү 1 1 


7 270) O) 


dv 2g 
м «бф — 2ni_J = ii 


where 
1 1 |ui 10) 2лі no) 


c0) 0) |? 20) 0) 
Under os v. 


Similarly from the longitudinal component of G3, in (25) one obtains 
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CES 1 k 2z ô (w — 
у" ENS фе, do = aur F 27 0.(v— u) E 
g с 


N, 2лї 2лі с у &t(y) 


whence on evaluation of the residue and branch-line contribution within the 
contour C 


AES" ОИ “7 b.(v—u) à. (v—uy]dv 
т or лт M 


If for simplicity one adopts the normalization Е„+ = Ко for the longitudinal 
component of E,*, then via eqs (27) and (28) one identifies a discrete longi- 
tudinal oscillatory eigenmode with eigenvalue » — 0, and eigenvector and 
normalization given by 


Ko 
VL kí N, = &(0) (29a) 
a ik &(v) , a 


и 


Similarly the branch-line contribution in (28) yields a continuum of eigen- 
modes with real eigenvalues —co < v < +00, and eigenvectors and normal- 
izations 


ko s 
Lo 0 , Ne= dns (29b) 
—ik e.) Fol") 19 


where 


о сое 


nC) e* (9) e (9) 
= et (u) ô (w — и) + ec (u) 6_@— и). 


It is evident that, for the electron plasma being considered, det L(k, w) 
possesses only one zero on the physical sheet. However, as noted in sec. 2, 
det L(k, w) may have zeros on the unphysical sheet. Examination of e+(v) in 
the lower half plane of the unphysical sheet reveals complex zeros at > = v, 
ie. det L(k, œ) does indeed have complex zeros at w = Кэ. Via analytic 
continuation into the unphysical sheet as sketched in connection with relations 
(12) and (14), one identifies from the expressions for G in (25) and from 
the v = », residue contributions to the integral in (12) the longitudinal quasi- 
mode vector and normalization 
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Ko 
we 0 _ d А 
‘“— ik Ev) Я NIS (v) (80) 
u— 2; s 


with £*(»j) == 0. It should be observed that if the electron plasma were un- 
stable (by appropriate choice of fọ), there would be eigenvectors of the same 
form as the quasimode vector in (30) but >, would lie in the upper half plane 
of the physical sheet. 

The integration procedure employed in eqs (28) and (29) to infer the longi- 
tudinal wavevectors from the longitudinal Green's functions in (25) can be 
repeated for the transverse Green's functions in (26). Thus from (26) and (11) 
one infers from the transverse G 11 the transverse completeness relation: 


E, E,** 1 1; dy 
) =— — $ rdc 
Na 2лі ; 2лі ; КД 
which by contour integration yields the branch line contribution: 
lr qp 1 
= — —— — —— |d. (31a) 
2дй + èo) at) 


Similarly from the б, ı component 


H, E,** C ko X17 dy 
ee ee 
Na — 2ni 2mi У» &t(y) 


a 


€ ko X1y j| E 1 | 
с mi JL ee) $91» (31b) 
From the transverse С, 1 component in (26) one infers 
f, E,** l pfa ó.(u— » 
= $ G5, do = F ik Au) Ver Fo(v?) pay 
2лї ¿ e #+ (2) 


= ik Ao) Vor Fo?) ДЕ = » S à. (310) 


Equations (31) imply a continuous spectrum of eigenvalues — оо < » < +00 
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with eigenvectors and normalizations given by (if one adopts the adjoint nor- 
malization E,* = To): 


To 
: c ko X T $0)? 
ФТ —E 028 , NS = |2001 D (32) 
У 10) 


ik GU) To . Ver Folv") 


where the transverse (to Ко) unit dyadic 17 = Ty’ To’ + To” To”, and in (32) 
То is the unit vector Ty’ or To” which defines the polarization of the two dis- 
tinct types of transverse eigenvectors, and where 


TED ө? 8-(и— 9) 
мы = ш LARAN QU. AS 

10) èto) 2-0) 
For a stable plasma the form (26) of &+(») implies that there are no discrete 
eigenvectors nor significant quasimodes; i.e. no significant zeros of det L(k, о) 
arise from the transverse structure of the field. 


| = @t(u)6,(y—u)-+ 8 (u) ô- (v—u). 


Adjoint wavevectors P,* 


The determination of the adjoint wavevectors У,+, corresponding to Y, 
and with components defined in (27), is based on the matrix Green's function 
elements G3, and G;;. From the inverse of the matrix operator ДК, œw) in 
(18b), one finds that G33 = Сзз(у, v) is defined by 


w a ee ix 1; [ve doi Mr T 
(«-2) 337 Ok ^-(ь eae) VG33 ye ) 


1— К? c?/w? 


whose form implies that G33 can be decomposed into parts even and odd in 
ҮТ, viz. 


G3s(v, v) = Е (ҮТ) — 


„(и, u' (и, и' 
gon u) = = ) Мут Fo(v7) . v'7. (34) 


The even part defines a longitudinal contribution g,(u, и”) which may be de- 
termined by substitution of (34) into (33) and integration over v? as 


(u— >) g, + nlu) | / Sy du + _| = 0(и— ш), | Im» £0; (35a) 
v 


while the odd part defines a transverse contribution g,(u, u’) determined on 
substitution of (34) into (33), multiplication by v*, and integration over v7, by 


(i— 98,4 - Т0 


а ар) | £,du—(u—w) Пау 0. (35b) 
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Notation is the same as in eqs (23) et seq. Integration of eqs (35) over и and 
elimination of the g, and £, integrals from (35) then leads to the explicit ex- 
pressions: 


gu, и) = fw- u)— 
u—?v 


ыс | Im» 5 0, (36a) 


1+ f byoofu— 9] du 
Alu’ — ») 
»— ep + SLA u— »)] du 


Use of the regular decompositions (23) permits one to extend eqs (36) to all » 
in the physical sheet, viz. 


&(u, и”) = 28 Е и) — I Im» z 0. (36b) 
и—7 | 


gy(u, и”) = + 2лї 0,(u — v) | e- u^) F 2лї M (37a) 
v є®(у) 
&(u, и”) = + 2xi 0(u — v) L и”) F 2ni көзе»! (37b) 
8*(v) 


From the completeness relation (11) and from (27) and (37a) one infers by 
means of the longitudinal part G33" of G33 the following longitudinal relation 


for fa: 
+* T 
Уу 5 f Gast do === si | феи) 


2лі 2лі 
с с 


which, on evaluation of the residue at the pole of g, and the branch-line con- 
tribution along Іт у = 0, becomes 


M n(u) Fo(v?) Т 
= "ws + Fo(v7) х 


n" 
x | sa б,(ф— и) + e-()8.6— 2| | 
y 


= о 


ó,(u—v) 8 (и —– ә) 
+0) 6 |” 
et(y ET (v 
(38) 
Comparison of (38) and (29a) then permits the identification of the discrete 
longitudinal > = 0 adjoint eigenvector as 


Ko 
wits 0 (39a) 
Lik 
and, on comparison with (29b), the continuous adjoint eigenvectors as 
ko 
тт —> 0 (39b) 


u Ф+(и)/ік 
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where 


à,(u— v) | à-(u— >] le^ p) 

e*(y) 20) J m0) na) 

One can now verify that the eigenvectors and their adjoints satisfy the bi- 
orthogonality property (5a). . 

The analytic continuation procedure used for finding (30) can be employed 
with the representation (12) involving G33" to obtain, via contour deformation 
and evaluation of the residue at the complex zero >; of e*(y) in the unphysical 
sheet, the longitudinal adjoint quasimode wavevector: 


ko 


Фу) = | 


Vg fie , (40) 


ik (u— »;*) 
as used in the representation (14). 

A procedure similar to the above, but based on the transverse part of G33 
and as well on G;,, leads to the determination of the transverse adjoint eigen- 


vectors as 
To 


€ ko X To 
Ч o». (41) 
v7 To 9,* (u) 
то 
where T, = Т, or Ty” and 
ó,.(u—v) | ó .(u—») 
$*() $70) - 
From the magnetic field G22 representation and (11) one also identifies a 
static magnetic field eigenvector with eigenvalue >» = 0 as 


9 yt(u) = 


0 
Y= + >l kh) N,=1. (42) 
0 


4, Quasiparticles 


The discrete oscillatory quasimodes considered above are distinguished by a 
complex frequency ор = ов — iy *). For each mode type f, knowledge of the 
k-dependent c, permits the determination of the phase velocity, w,/k, of a 


*) Note that Op is the real part of the complex frequency wp. 
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single К wave, or the group velocity Vi @, of a wavepacket centered at К. 
Fields excited by sources or evolving from a prescribed initial state usually 
give rise to wavepackets, each distinguished by a group velocity and wave struc- 
ture characteristic of the mode type. Finitely extended wavepackets can be 
regarded as composed of “point” wavepackets, or quasiparticles, each with a 
position r;(f) and momentum k,(¢). To elucidate this view we shall first employ 
the completeness relation (14) to decompose fields into their constituent quasi- 
mode types. Such quasimode representations are particularly useful for systems 
admitting a continuous spectrum of oscillatory eigenmodes. 

For example, in linear systems wherein the eigenspectrum is continuous, let 
(r, 0) represent an initially prescribed field at t = 0. From (14), on multi- 
plication by exp (ik.r) and integration over k, one obtains *) 


0)— Falk) k, 0 toc 43 
veo у as ep kn) а (43) 
p 


where 


ag(k, 0) = Í (00+, W w(r, 0)) exp (—i k . r) dr. 


The fth quasimode contribution to (43) at time / then follows from eqs (1) 
and (4) as 


yy ‚= [209 шо, t) exp Gk. r) —— (44a) 
Lk NK) ^ = = 
where 
(; = + s) адк, t) = 0. (44b) 
i ot 


Equation (44a) is descriptive of a wavepacket if the latter has formed by time f. 
A coarse but useful description of the evolution of this wavepacket as a func- 
tion of r, ¢ is provided by the “energetic” measure of y,(r,¢) given by the 
Hermitean inner product (ys, W Yẹ). To introduce this measure one defines 
the adjoint to the representation (44) of ys, obtained from (14), as 


, 


(2л)? 


ри t) = | PE) аз, 2) exp (ik. rs (45a) 
Where 
10 
(; —— so) ад, t) = 0. (45b) 
iot 


Using eqs (44), (45), and the normality property (5a), one finds with r" = r — r' 
and k” = Е К: 
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(эж. t), W v(t, 0) = f | a,(k, t) ag**(k’, t) exp [i (k . x" +k" or 


dk 


= | ЕК, r’, t) exp (ik . 1^) (om 


where the spectral measure Ар is defined by 


Fs т, £) = | а, г) ag**(&’, 1) exp К” « x) (47) 


(2m)? 

The spectral function F;(k, r, t) will be shown to be interpretable as a time- 
dependent density in a К, г phase space used to depict the dynamics of quasi- 
particles of momentum k(t), position r(t), and “energy” c. To deduce this 
interpretation we shall assume a weakly inhomogeneous background so that 
wp = (К, г) becomes weakly dependent on r; for simplicity of notation, the 
thereby implied dependence of ag and аз? on r will be left implicit. From 
eqs (44b) and (45b), one infers 


10 
[5 = + walks п) – ҖЕ, r] адк, t) аз** (К, г) = 0. 
1 


In successive steps one then obtains 


1d 
ff E + ex(k, r' + 1)— оК К”, r) | ад, t) ast (k^, t) х 
i 


d 
x exp (К.г k.r^)] Олд = 0 


and, on replacing k” by \/'/ї when acting on exp (і k” . г”), апа r” by V Ji 
when operating on exp (і К.г”) only or by — V ,/i when acting only on ар a5*, 
and using (47): 


exp(ik "|: + о (x r'— =)- w (к- ы r) ne r укын 
f opak. ia PUO i : i| JIO Om 

= 0, 
and by uniqueness of this Fourier transform, the defining equation for 7; be- 


comes 
19 V 
Е em + w(x, Y — EJ = w(x- TERE) : F,(k, r, t) = 0. (48) 
1 H І 


Setting w,(k, г) = оК, г) — i ув(К, r), one can expand (48) for systems 
wherein ор is weakly dependent on К, г as: 


d = = 
[2+ Velie vo Ve |a 19s (49) 
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Equation (49) is evidently a “kinetic equation for waves” (or quasiparticles), 
with higher-order diffusion terms in r and k-space omitted for simplicity. In 
its indicated form, (49) defines F;(k, r,t) as the К, г phase space density at 
time ¢ of quasiparticles of position r(?) and momentum k(t). Along the trajec- 
tory (“characteristic”) defined by 


dr. _ dk - 
aS V к Og, qM Wp (50) 
the indicated kinetic equation (49) becomes simply 
d 
ae =—2 ув Fs (51) 


which implies that, as one moves with a quasiparticle along the trajectory (50), 
quasiparticles are being annihilated at the rate 2y, per second. If Fj(k,r, 0) 
is chosen to be consistent with the prescribed initial condition y(r, 0), one can 
readily deduce therefrom F,(k, г, г) through solution of (49) by means of (50) 
and (51). Knowledge of how the quasiparticle phase space density evolves in 
r, ¢ then permits calculation of the evolution of the “envelope” of the wave- 
packet (44a) from (46b) by 


dk 
(yg, W ард) = J ЕК, г, t) Ол) ; (52) 


whose interpretations as a quasiparticle “fluid density" is apparent. 

It should be remarked that the analysis in eqs (46)-(52) is patterned on a 
procedure in turbulence theory wherein y; and a, are weakly correlated in r 
and k, respectively; in this stochastic analysis many of the approximations 
implied above are more evidently justified. 
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